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TTic invention concerns a method for creating, identifying, and isolating ribozymes capable of binding a selected ligand and catalvzine 
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NOVEL RIBOZYMES AND NOVEL RIBOZYME SELECTION SYSTEMS 
Background of the Invention 
5 This Invention relates to novel ribozyne molecules 

and methods for their identification and isolation. 

This invention was made with Government support 
under Contract #R01-GM453 15-02 awarded by the National 
Institutes of Health. The Government has certain rights 
10 in this invention* 

Both the genetic and enzymatic components of the 
earliest cells eure thought to have been RNA molecules, 
because RNA is the only known macromolecule that can both 
encode information in a heritable form, and act as a 
15 biocatalyst (Joyce, Nature 338:217, 1989). It has been 
proposed that modern metabolism evolved prior to the 
evolution of encoded protein synthesis, and that early 
ribozyme-catalyzed metabolic transformations form the 
basis of our present protein-catalyzed metabolism (Benner 
20 et al., Proc. Natl. Acad, Sci. USA 86:7054, 1989) . This 
proposal requires that ribozymes should be able to 
catalyze a broad range of chemical transformations. 
However, to date, known natural ribozymes, including the 
group I and group II introns, RNAse P, and the hammerhead 
25 and hairpin RNAs, have been shown to catalyze only a 
restricted range of reactions involving the RNA sugar- 
phosphate backbone (Wilson and Szostak, Curr. Ooin. 
gt;rupt:, Biol. 2:749, 1992). 

Summary of the Invention 
30 The invention concerns a method for creating, 

identifying, and isolating catalytic RNA molecules 
capable of binding a ligand and catalyzing a reaction 
modifying the catalytic RNA (or other substrate) . The 
method entails sequential selections for ligand binding 
35 RNA molecules and catalytic RNA molecules. 
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The ribozymes isolated by the method of the 
invention are capable of catalyzing reactions normally 
catalyzed by enzymes. Previously, the art disclosed 
ribozymes capable of catalyzing reactions involving the 
5 RNA sugar-phosphate backbone, e.g., phosphodi ester 

transfer reactions and hydrolysis of nucleic acids. The 
methods of the invention can be used to create ribozymes 
capable of carrying out reactions on the RNA sugar- 
phosphate backbone. In addition, however, ribozymes 

10 created by the method of the invention can catalyze 

reactions other than hydrolysis and transesterif ication, 
thereby increasing the range of systems for which the 
catalytic ribozymes and the catalytic ribozyme selection 
systems of the invention are useful. 

15 The methods of the invention entail sequential in 

vitro selections using pools of RNA molecules which 
include one or more regions of random sequence. Because 
catalysis of a complex reaction demands both the ability 
to bind a non-RNA ligand and the preferential 

20 stabilization of the transition state configuration of 
the reactants, the number of functional ribozymes in a 
pool of RNA having one or more regions of random sec[uence 
may be vanishingly small. The methods of the invention 
overcome this difficulty through the use of sequential 

25 selections. The method of the invention entails at least 
two selections steps: a binding selection step for 
identifying in a pool of random RNA molecules those RNA 
molecules which are capable of binding the selected 
ligand and a catalysis selection step for identifying in 

30 a pool of substrate binding RNA molecules (or sequence 
variants of such RNA molecules) those which are capable 
of catalyzing a reaction which modifies the catalytic RNA 
(or other substrate) . After each selection step, an 
amplification step is performed. In this amplification 

35 step, the selected molecules are amplified using PGR. Of 
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course, as explained more fully below, the binding 
selection step and the catalysis selection step may 
Include one, two, or more rounds of selection and 
amplification. After each round, the pool of molecules 
5 Is enriched for those having the desired binding or 
catalysis activity* Thus, the methods of the invention 
effectively entail three steps: 1) selection of BNA 
molecules capable of binding a chosen llgand from a pool 
of RNA molecules having a region of random sequence; 2) 

10 generation of a pool of RNA molecules which have a llgand 
binding sequence which is based on the identified llgand 
binding secpience of llgand-blndlng RNA molecules selected 
in step 1 as well as a region of random sequence; and 3) 
selection of RNA molecules exhibiting catalytic activity 

15 which modifies the RNA molecule itself or a substrate 

attached to the catalytic RNA* To Identify catalytic RNA 
molecules one must tag the active molecules so that they 
may be partitioned from the inactive ones. This tagging 
is most straightforward when the reaction catalyzed by 

20 the RNA molecule modifies the catalytic RNA molecule 

itself. This modification can Involve the formation of a 
chemical bond, the breaking of a chemical bond, or both. 
Often the modification attaches one or more new atoms to 
the RNA. Other desirable modifications remove one or 

25 more atoms from the RNA. To be useful for tagging the 
modification must render the modified molecules 
distinguishable from non-modified molecules. Tagging can 
also be accomplished by modification of a substrate 
attached to the catalytic RNA molecule. If all of the 

30 molecules in the pool are attached to a substrate 
molecule, those RNA molecules which can catalyze a 
reaction modifying the attached sustrate can be 
partitioned from the RNA molecules which do not carry out 
the modification. 
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Of course, one nay find that a pool of catalytic 
RNA molecules is capable of carrying out a ntimber of 
different modifications. 

The selected ligand can include small molecules 
5 such as drugs, metabolites, cof actors, toxins, and 

transition state analogs. Possible ligands also include 
proteins, polysaccharides, glycoproteins, hormones, 
receptors, lipids, and natural or synthetic polymers. 
Preferably, for therapeutic applications, binding of the 

10 ligand and catalysis takes place in aqueous solution 
under physiological or near physiological salt 
conditions, temperature, and pH. 

It is important to note that the ligand used to 
identify ligand-binding RNA molecules may be, but does 

15 not have to be, the same ligand which is used in the 
catalyst selection step. One may wish, for example, to 
isolate ligand-binding RNA molecules using a first ligand 
(e.g., ATP) and then isolate catalytic RNA molecules with 
a second ligand (e.g., ATP-y-S) which can bind to the 

20 same ligand binding region. 

As mentioned above, the method of the invention 
entails at least two selection steps. In the first step, 
RNA molecules capable of binding the chosen ligand are 
selected from a pool of RNA molecules which include one 

25 or more regions of random sequence. In the second 
selection step, RNA molecules capable of catalyzing a 
reaction modifying the RNA (or other substrate) are 
chosen from a second pool of random RNA molecules whose 
sequence is based on the sequence of one or more ligand 

30 binding RNAs identified in the first selection step. 

"Random RNAs" and "random sequence" are general 
terms used to describe molecules or sequences which have 
one or more regions of "fully random sequence" and/or one 
or more regions of "partially random sequence." Such 

35 molecules may also include one or more regions of 
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"defined sequence." "Fully random sequence" is sequence 
in vAiich there is a roughly equal probability of each of 
A, T, and G being present at each position in the 
sequence. Of course, the limitations of some of the 
5 methods used to create nucleic acid molecules make it 
rather difficult to create fully random sequences in 
vhich the probability of each nucleotide occurring at 
each position is absolutely equal. Accordingly, 
sequences in vhich the probabilities are roughly equal 
10 are considered fully random sequences. In "partially 
random sequences" and "partially randomized sec[uences," 
rather than there being a 25% chance of each of A, T, C, 
and G being present at each position, there are unequal 
probabilities. For example, in a partially random 
15 sequence, there may be a 70% chance of A being present at 
a given position and a 10% chance of each of T, c, and G 
being present. Further, the probabilities can be the 
same or different at each position within the partially 
^randomized region. Thus, a partially rfimdom sequence may 
20 include one or more positions at which the sequence is 
fully random and one or more positions at which the 
sequence is defined. Such partially random sequences are 
particularly useful when one wishes to make variants of a 
known sequence. For example, if one knows that a 
25 particular 20 base sequence binds the selected ligand and 
that positions 2, 3, 4, 12, 13, and 15-20 are critical 
for binding, one could prepare a partially random version 
of the 20 base sequence in which the bases at positions 
2, 3,4, 12, 13, and 15*20 are the same as in the known 
30 ligand binding sequence and the other positions are fully 
randomized. Alternatively, one could prepare a partially 
random sequence in which positions 2, 3, 4, 12, 13, and 
15-20 are partially randomized, but with a strong bias 
towards the bases found at each position in the original 
35 molecule, with all of the other positions being fully 
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randomized. This type of partially random sec[uence is 
desirable in pools o£ molecules from which catalytic RNAs 
are being selected* 

As discussed below, the sequence of any randomized 
5 region may be further randomized by mutagenesis during 
one or more amplification steps as part of a process 
referred to as In vitro evolution. 

It is desirable to have one, preferably two, 
regions of **defined sequence'*. Defined sequence is 

10 sequence selected or known by the creator of the 

molecule. Such defined sequence regions are useful for 
isolating and amplifying the nucleic acid because they 
are recognized by defined complementary primers. The 
defined primers can be used to isolate or amplify 

15 sequences having the corresponding defined sequences. 
The defined sequence regions preferably flank the 
randomized regions. The defined region or regions can 
also be intermingled with the randomized regions. Both 
the random and specified regions can be of any desired 

20 length. 

In the first step, nucleic acids capable of 
binding the ligand are identified. Beginning with a pool 
of nucleic acids which include one or more regions of 
random sequence, the method for isolating ligand-binding 

25 molecules includes contacting the pool of nucleic acid 
with the substrate under conditions which are favorable 
for binding, partitioning nucleic acids which have bound 
the substrate from those which have not, dissociating 
boxind nucleic acids and substrate, amplifying the nucleic 

30 acids (e.g., using PGR) which were previously bound, and, 
if desired, repeating the steps of binding, partitioning, 
dissociating, and amplifying any desired number of times. 

Several cycles of selection (binding, 
partitioning, dissociating, and amplifying) are desirable 

35 because after each round the pool is more enriched for 
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substrate binding nucleic acids. One can perform 
additional cycles of selection until no substantial 
improvement in substrate binding is observed. Of course, 
one can also perform far fever cycles of selection. 
5 In many cases, sequencing of nucleic acids 

Isolated after one or more rounds of partitioning and 
amplification will reveal the presence of a number of 
different nucleic acids. One or more of these nucleic 
acids can be used in the pool of nucleic acids from which 
10 catalytic nucleic acids are Isolated in the second 

selection of the method of the invention. Alternatively, 
the pool for the second phase can be composed of one or 
more nucleic acids having sequences based on the 
sequences of the nucleic acids identified in the binding 
l5_selection. For example, sequencing of the nucleic acids 
which bind the substrate may suggest one or more regions 
of consensus sequence, i.e., sequences which appear to be 
important for binding. The pool of molecules used for 
selection of catalytic molecules may then include nucleic 
20 acids whose sequence is based on this consensus sequence. 
One may also employ a partially randomized sequence based 
on the consensus sequence. This may permit the isolation 
of improved binding domains. It can also permit 
alterations of the binding domain which may be desirable 
25 for improved catalysis. Of course, as discussed above, 
the degree of randomization of the consensus sequence is 
generally quite low. The consensus sequence region, 
randomized or not, may be Interspersed with and/ or 
flanked by additional randomized regions. Thus, the 
30 sequences of the molecules in the pool of nucleic acids 
molecules used in the catalysis selection step can differ 
from that of the molecule (s) identified in the substrate 
selection step as molecules capable of binding the 
desired substrate. 
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Those Skilled in the art can readily identify 
ligand'binding consensus sequences by sequencing a number 
of ligand-binding RNA molecules and comparing their 
sequences. In some cases such sequencing and comparison 
5 will reveal the presence of a number of different classes 
of ligand binding sequences (aptamers) . In these 
circumstances it may be possible to identify a core 
sequence which is common to most or all classes. This 
core sequence or variants thereof can be used as the 

10 starting point for the catalysis selection. By ** variant" 
of a ligand binding sequence is meant a secjuence created 
by partially randomizing a ligand binding sequence. 

The size of the randomized regions employed should 
be adec(uate to provide a substrate binding site in the 

15 case of the binding selection step. Thus, the randomized 
region used in the initial selection preferably includes 
between 15 and 60 nucleotides, more preferably between 20 
and 40 nucleotides. The randomized region or regions 
used for the catalysis selection step should be of 

20 sufficient length to provide a reasonable probability of 
being able to include catalytic activity. 

The probability that any given RNA sequence of 30, 
50, 100, or even 400 bases includes a region capable of 
binding a chosen substrate is very low. Similarly, the 

25 probability that a given RNA sec[uence which includes a 
region capable of binding a chosen substrate also has a 
region capable of catalyzing a reaction involving the 
chosen substrate is very low* Because of this each, of 
the two selection steps preferably begins with a pool of 

30 molecules which is large enough and random enough to 

include molecules which can bind the chosen substrate in 
the case of the binding selection or catalyze a reaction 
involving the chosen substrate in the case of the 
catalysis selection. Accordingly, the molecules used in 

35 each initial pool include at least one fully random 
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sequence region. Binding sites may occur at a frequency 
of 10"^^ to 10"^^ in random sequences. Thus, pool sizes 
are preferably greater than 10^^. 

It is generally not practical to prepare a 
5 population of molecules which includes all of the 
possible sequences of a particular random sequence. 
However, even where one has a population of no more than 
10^5 different molecules out of 10^^ potential sequences, 
one can isolate molecules having a desired binding or 
10 catalytic activity. 

The catalysis selection step involves identifying 
RNAs which catalyze a reaction involving the chosen 
ligand. The pool of molecules used at the outset of this 
selection step generally is composed of molecules having 
15 one or more defined or partially randomized sequences 
which are designed to bind to the chosen ligand ("ligand 
binding region") as well as a second random sequence 
region, preferably fully randomized which serves as the 
source of potentially catalytic sequences. The ligand 
20 binding region included in the molecules in this 

catalysis selection pool can have a sequence which is 
identical to an identified ligand binding sequence 
identified in the binding selection phase. Alternatively 
the sequence of this region can be based on the consensus 
25 sequence of a number of substrate binding regions 

identified in the first step. The region may also be a 
partially randomized sequenced based on either a 
particular substrate binding sequence or substrate 
binding consensus sequence. Of course, the molecules 
30 also preferably include one or more defined sequence 
regions which can bind isolation or amplification 
primers. 

In order to identify molecules having catalytic 
activity there must be a means for partitioning those RNA 
35 molecules which have catalyzed a reaction modifying the 
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RNA molecule (or a substrate attached to the RNA) from 
those which have not* The selection can be accomplished 
using affinity columns which will bind modified, but not 
unmodified molecules. Alternatively, one can employ an 
5 antibody which recognizes the modified, but not 

unmodified molecules. It is also possible to chemically 
convert modified, but not unmodified ligand, to a 
compound which will bind selectively to an affinity 
col\imn or other selective binding material (e.g., an 

10 antibody) . 

In many cases the catalytic HNA will itself be 
chemically altered (modified) by the reaction it 
catalyzes. This alteration can then form the basis for 
selecting catalytic molecules. 

15 In many cases it may be possible to alter such 

catalytic RNA molecules so that instead of being self** 
modifying they modify a second molecule. 

As will be apparent from the examples below there 
are a number of means for partitioning catalytic 

20 molecules from non-catalytic or less catalytic molecules. 

It may be desirable to increase the stringency of 
a selection step in order to isolate more desirable 
molecules. The stringency of the binding selection step 
can be increased by decreasing ligand concentration. The 

25 stringency of the catalysis selection step can be 

increased by decreasing the ligand concentration or the 
reaction time. 

One can covalently link a molecule to be modified 
to RNA so that catalytic RNA molecules can be isolated by 

30 isolating the modified molecule. For example, one might 
wish to find RNAs capable of oxidizing compound A. This 
might be accomplished by isolating RNA molecules capable 
of binding a redox co-factor (NAD, FAD, or NADP) . A pool 
of random RNAs is then created which are capable of 
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binding the cof actor • Compound A is then covalently 
attached to the RNA molecules in this pool and a 
selection is carried out which isolates molecules having 
the oxidized form of compound A. Methods for linking 
5 various compounds to RNA are well known to those skilled 
in the art and include the use of a thiophosphate group 
and the use of amines linked via a 5' phosphate. 

Of course, in some cases a catalytic RNA which is 
capable of self-modification or modification of an 
10 attached substrate may also be able to perform the 

"trans" reaction. Such trans acting molecules modify an 
RNA other than themselves or modify the substrate even 
when it is not attached to the catalytic RNA. 

In one aspect, therefore, the invention features a 
15 method for producing a catalytic RNA molecule capable of 
binding a first ligand and catalyzing a chemical reaction 
modifying the catalytic RNA molecule. The method 
includes the following steps: 

a) providing a first population of RNA molecules 
20 each having a first region of random sequence; 

b) contacting the first population of RNA 
molecules with the first ligand; 

c) isolating a first ligand-binding 
subpopulation of the first population of RNA molecules by 

25 partitioning RNA molecules in this first population which 
specifically bind the first ligand from those which do 
not; 

d) amplifying the first ligand-binding 
subpopulation In vitro ? 

3^ identifying a first ligand binding sequence; 

f ) preparing a second population of RNA 
molecules each of the RNA molecules including the first 
ligand binding sequence and a second region of random 
sequence; 
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g) contacting the second population of RNA 
molecules with a second llgand capable of binding the 
first ligand binding sequence; and 

h) isolating a subpopulation of the catalytic 

5 HNA molecules from the second population of RNA molecules 
by partitioning RNA molecules which have been modified in 
step g) from those which have not been modified. 

In various preferred embodiments of the method, 
the first ligand is ATP, the first ligand is biotin, the 

10 second ligand serves as a substrate for the chemical 

reaction, and the first and second ligands are the same* 
In other preferred embodiments of the method, the 
catalytic RNA molecule can transfer a phosphate from a 
nucleotide triphosphate to the catalytic RNA molecule. 

15 In more pref ezrred embodiments of the method, the transfer 
is to the 5'-hydroxyl of the catalytic RNA molecule and 
the transfer is to an internal 2' -hydroxy 1 of the 
catalytic RNA molecule. 

In another preferred embodiment of the method, the 

20 catalytic RNA molecule can transfer a phosphate from a 
nucleotide triphosphate to a nucleic acid (preferably, a 
ribonucleic acid) other than the catalytic RNA molecule. 

In another preferred embodiment of the method, the 
catalytic RNA molecules can catalyze N-alkylation, the 

25 catalytic RNA molecule can catalyze N-alkylation of the 
catalytic RNA molecule, and the catalytic RNA molecule 
can catalyze N-alkylation of a nucleic acid other than 
the catalytic RNA molecule. 

In another aspect, the invention features a 

30 catalytic RNA molecule which can transfer a phosphate 
from a nucleotide triphosphate to the catalytic RNA 
molecule. In preferred embodiments, the transfer is to 
the 5'-hydroxyl of the catalytic RNA molecule and the 
transfer is to an internal 2'-hydroxyl of the catalytic 

35 RNA molecule. 
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In another aspect, the Invention features a 
catalytic RNA molecule which can transfer a phosphate 
from a nucleotide triphosphate to a nucleic acid 
(preferably, a ribonucleic acid) other than the catalytic 
5 KNA molecule. 

In another aspect, the invention features a 
catalytic RNA capable of catalyzing N-alJcylation» In 
preferred embodiments, the catalytic RNA molecule can 
catalyze N-alkylation of the catalytic RNA molecule, and 
10 the catalytic RNA molecule can catalyze N-alkylation of a 
nucleic acid other than the catalytic RNA molecule. 

In another aspect, the invention features a method 
for producing a catalytic RNA molecule capable of binding 
a first ligeuid and catalyzing a chemical reaction 
15^ modifying a first substrate molecule bound to the 

catalytic RNA molecule. The method entails the following 
steps: 

a) providing a first population of RNA molecules 
each having a first region of random secpience; 
20, b) contacting the first population with the 

first ligand; 

c) isolating a first ligand-binding 
subpopulation of the first population of RNA molecules by 
partitioning RNA molecules in the first population of RNA 

25 molecules which specifically bind the first ligand from 
those which do not; 

d) amplifying the first ligand binding 
subpopulation in YitEfi; 

e) identifying a first ligand binding sequence; 
30 f ) preparing a second population of RNA 

molecules each of the RNA molecules including the first 
ligand binding sec[uence and a second region of random 
sequence, each of the RNA molecules being bound to the 
first substrate molecule; 
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g) contacting the second population of RNA 
molecules with a second ligand capable of binding the 
first ligand-binding sequence; and 

h) isolating a subpopulation of the catalytic 

5 RNA molecules from the second population of RNA molecules 
by partitioning RNA molecules which eure bound to a 
svibstrate molecule which has been modified in step g) 
from those PNA molecules which are bound to a substrate 
molecule which has not been modified in step.g) • 

10 In a preferred embodiment of this method, the 

second ligand serves as a second substrate for the 
chemical reaction. 

The invention also features ribozymes having 
polynucleotide kinase activity. Such ribozymes have 80%, 

15 preferably 85%, more preferably 95% homology to any of 
classes I - VII polynucleotide kinase ribozymes described 
in FIG. 5. More preferably such ribozymes have 90% (more 
preferably 95%) homology to the core catalytic region of 
any of these classes of ribozymes. The core catalytic 

20 region is the minimal sequence required for catalytic 

activity. This sequence can be determined using standard 
deletion analysis. 

The invention also features ribozymes capable of 
carrying out an alky lat ion reaction. In a preferred 

25 embodiment the ribozyme has 90%, and preferably 95% 
homology to BL-E. 

Other features and advantages of the invention 
will be apparent from the description of the preferred 
embodiments, and from the claims. 

30 Description of the Drawings 

FIG. 1 is a schematic illustration of a minimal 
ATP aptamer (SEQ ID NO: 4). 

FIG. 2 is a schematic illustration of the random 
RNA pool built around the ATP aptamer structure and the 

35 selection scheme (SEQ ID NO: 5) . The pool contained 
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three regions of remdom sequence (N) for a total of lOO 
randomized bases. The aptamer region was mutagenized to 
a level of 15% • The Ban I site used to ligate the two 
halves of the pool is shown in gray. Constant primer 
5 binding sites are shown as thick lines. Random pool RNA 
was allowed to react with ATP-y-S and thiophosphorylated 
molecules were isolated by reaction with thiopyridine- 
activated thiopropyl sepharose. Non-specif ically bound 
molecules were removed by washing under denaturing 
10 conditions. Active molecules were eluted with 2- 
mercaptoethanol. Constant regions: 5'- 

GGAACCUCUAGGUCAUUAAGA-3' {5 '-end constant region) (SEQ ID 
NO: 1); 5'-ACGUCAGAAGGAUCCAAG-3r (3'-end constant region) 
(SEQ ID NO: 2) . 

^IG. 3 is a graph showing the percent RMA eluted 
by 2-mercaptoethanol from the thiopyridine-activated 
thiopropyl Sepharose at each cycle of selection. 
Background sticking and elution from the resin is 
approximately 0.5%. The concentration of ATP-y-s used in 
20 each selection and the incubation time for each selection 
is shown below the graph. Also indicated is whether the 
selection entailed mutagenic PCR. 

FIG. 4 is a graph showing the k^^e Pooi RNA for 
selection cycles 6-10, 12 and 13. Reactions were 
25 performed with 100 fM ATP-y-S, and a time point was 

chosen such that < 20% of the pool had reacted. At cycle 
6, the activity of the pool could be readily detected. 
The following seven cycles increased the activity by 
nearly three orders of magnitude. The drop in k^^e 
30 cycle 8 is presumably due to the effects of mutagenic 
PGR, coupled with the fact that the pool was no longer 
immobilized on streptavidin in this cycle. Cycle 11 
activity declined for unknown reasons. 

FIG. 5 illustrates the sequences of molecules 
35 representing the seven major kinase classes (50 clones 
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sequenced) (SEQ ID NOS: 6-24). Arrows delimit the ATP 
aptamer conserved loop. The Ban I site used for pool 
construction (see FIG. 2) is underlined* Complementarity 
between the random region and the (constant) 5 '-end of 
5 the RNA is shaded (Classes I and V) . Both of these 
classes are 5 '-kinases; these regions may serve to bind 
the 5 '-end in the active site of the ribozymes. Sites of 
2'-thiophosphorylation are shown as white letters in 
black boxes. Clone Kin. 47 is inactive, and contains a G 
0 to A mutation at the site of 2'-thiophosphorylation. The 
sequences of the constant primer binding regions (see 
FIG. 2) are not shown except for the first three bases 
following the 5' primer binding site (AGA) . The length 
of the original pool (not including primer binding sites) 
5 was 138 nucleotides. Point deletions may have occurred 
during the chemical synthesis of the pool DNA, and larger 
deletions may be due to annealing of primers to sites in 
the random regions during reverse-transcription or PCR. 
FIG. 6 is a set of schematic illustrations of 
10 proposed structures of the ATP aptamer consensus and 
several classes of ATP aptamer (SEQ ID NOS: 25-29). In 
the illustration of the consensus aptamer conserved bases 
in the loop are shown in capital letters. Positions that 
tend to be A, but which can vary, are shown as "a'^s. The 
15 bulged G is also conserved, but the stem regions (aside 
from being base paired) and the right hand loop are not. 
For the schematic illustrations of possible secondary 
structures of the ATP aptamer domains of four of the 
major classes of ribozymes, the sequence of the most 
JO active clone is shown in each case. Positions in the 
loop regions that differ from the consensus sequence for 
the ATP aptamer are highlighted in gray. One of the stem 
regions from each of Classes II, VI and VII is missing, 
and so these structures are not shown. 
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FIG. 7A is a schematic illustration of a ribozyme 
capable of tremsf erring a phosphate to its 5' end (SEQ ID 
NO: 30) . FIG. 7B is a schematic of a trans-acting 
ribozyme and a substrate (GGAACCU) . 
5 FIG. 8 A is a strategy for in vitro evolution of 

self -alkylating ribozymes. FIG. 8B is a scheme for 
isolating biot in-binding PNAs by affinity chromatography. 
FIG. 8C is a scheme for isolating self-*biotinylating RNA 
enzymes. FIG. 8D shows coding sequences for RNA pools 

10 used for in vitro selection experiments (SEQ ID NOS: 32- 
34) • Upper case A, C, G, T: pure nucleotide. N: 
eq[uimolar mix of A, C, G, T. Lower case a, c, g, t: 70% 
major nucleotide, 10% each of three minor nucleotides. 
Underline: constant primer sequences used for 

15 amplification. 

FIG. 9A illustrates progress of the biotin aptamer 
selection. Biotin-eluted HNA expressed as a percentage 
of total RNA applied to the biotin-agarose column is 
plotted as a function of selection cycle. Individual 

20 RNAs eluted from the seventh round were subcloned and 
sequenced. Greater than 90% of the clones correspond to 
the sequence shown in FIG. 12. FIG. 9B illustrates 
progress of the self-biotinylation selection. Ligation 
rate determined by incubation with 200 BI£ followed by 

25 streptavidin-agarose purification. Values are corrected 
for 0.02% non-specific RNA binding. 

FIG. lOA is a site-specific alkylation reaction 
catalyzed by BL8-6 ribozyme. 5 '-end labeled BL8-6 RNA 
was allowed to react overnight with 200 aiH BIE and then 

30 separated by streptavidin affinity chromatography into 
biotinylated and non-biotinylated fractions. RNA was 
then treated with sodium borohydride and aniline acetate 
to specifically cleave at N7 alkylation sites. For 
comparison, DMS-treated RNA was treated in parallel. A 

35 single major cleavage site in the biotinylated fraction 
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(corresponding to 6ua-96) Is absent from the non- 
blot inylated RNA. Minor bands present in the non- 
blotinylated fraction appear to result from non-specific 
BNA cleavage as judged by their greater intensity in BL8- 
5 6 RNA subjected to partial alkaline hydrolysis. FIG. lOB 
illustrates the inferred N-alklyation reaction at the N7 
position of G-96. 

FIG. 11 illustrates f \inctional biotin binder and 
biotin ligator sequences (SEQ ID N05: 35-86) . The 

10 partially-randomized pool sequence is shown above each 
set of sequences. Deviations from the principle 
nucleotide at each position sure explicitly written while 
conservation of the wide type base is indicated with a 
dash. Biotin aptamer and self-biotinylating RNA 

15 partially-randomized pools were re-selected for biotin- 
agarose binding and self-blotinylation respectively. 
Biotin aptamer sequences correspond to clones from the 
fourth roxind of re-selection. Self-biotinylating 
ribozyme clones were sequenced after eight rounds of re- 

20 selection, when the overall biotinylation activity of the 
pool was 100 times the activity of the initial BL8-6 
ribozyme. Arrows are used to indicate the locations of 
proposed helices. Boxed nucleotides are highly conserved 
yet not involved in secondary structure. 

25 FIG. 12 A and FIG. 12B illustrate the proposed 

secondary structures for the biotin aptamer and the self- 
biotinylating ribozyme. Nucleotides within the boxed 
region are highly conserved and make up the essential 
core of the apteuaer and ribozyme. Asterisks indicate 

30 pairs of positions that co-vary in a Watson-Crick sense. 
Nucleotides in the constant primer sequences are shown in 
italics. FIG. 12A is a complete sequence of the BB8-5 
biotin aptamer, shown as the proposed pseudoknot. FIG. 
12B (SEQ ID NO: 91) is a sequence and proposed cloverleaf 

35 structure for the BL8-6 self-biotinylating ribozyme. The 
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guanosine residue that serves as the alkylation site for 
the biotinylation reaction is circled. 

FIG. 13A illustrates the sequence (SEQ ID NO: 87) 
of a clone obtained from the partially-randomized 
5 ribozyme pool after re-selecting for biotinylation 

activity (BL2.8-9) was modified to allow folding into an 
idealized clover leaf structure. FIG. 13B illustrates in 
vitro transcribed RNA assayed for self -biotinylation with 
10 fM BIE. Folding was calculated by the LRNA Program 

10 (Zuker, Science 244:48, 1989) 

FIG. 14B shows the results of a ribozyme-mediated 
biotinylation of a separate RNA substrate. The designed 
self-biotinylating ribozyme (FIG. 13 A) was re-engineered 
into two halves, BL-E and BL-S, that could respectively 

15 .serve as the enzyme and substrate for the biotinylation 
reaction. This re-engineered molecule is illustrated in 
FIG 14A (SEQ ID NOS: 88, 89). To assay the two piece 
system, 5 (M radiolabelled BL-S RNA was incubated in the 
- presence of 200 fM BIE and 0 to 500 nM unlabelled BL-E 

20 RNA. RNA biotinylation was determined as described 
herein. The reaction plateaus overnight at a level 
corresponding to one equivalent of product. 

Description of the Preferred Embodiments 

EXAMPLE 1 

25 In one example of the invention, RNA molecules 

which bind ATP were first isolated from a pool of random 
RNA. RNA molecules capable of binding ATP were 
sequenced, and the information obtained was used to 
design a second pool of RNA molecules which included an 

3 0 ATP binding site or variant thereof. This pool was then 
subjected to selection and amplification to identify RNA 
molecules having polynucleotide kinase activity. 
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Selection o f ATP-bindlna RNAs; The selection of ATP- 
binding RNAs was carried out in a manner designed to 
ensure selection of RNAs capable of binding ATP in 
solution as well as on an insoluble support. This was 
5 accomplished by selecting HNA molecules which bound an 
ATP^-sepharose column and could be eluted using ATP. 

A chemically synthesized pool of DNA molecules 
containing a central region of 120 random nucleotides 
flanked by constant regions used as primer binding sites 

10 was PCR-amplified and transcribed in vitro by T7 UNA 
polymerase in the presence of [a-^2p]GRP. RNA was 
ethanol^-precipitated and unincorporated nucleotides 
removed by Sephadex-G50 chromatography* Following a 
brief incubation at 65 in binding buffer (300 mM NaCl, 

15 20 Mm Tris, pH 7.6, 5 mM MgCl2) , the RNA was cooled to 
room temper attire before being loaded onto a 1-ml ATP 
agarose affinity column. The affinity matrix contained 
-1.6 mM ATP linked through its C8 position through a 
diaminohexyl linker to cyanogen bromide-activated agarose 

20 (Sigma, St. Louis, MO). After washing with 6 column- 
volumes of binding buffer, bound RNAs were eluted with 3 
column-volumes of binding buffer containing 4 mM ATP, 
then concentrated by precipitation with ethanol. For the 
first three cycles, an agarose pre-column was used to 

25 prevent enrichment of the RNA pool with agarose-binding 
RNAs, and bound RNAs were eluted with 5 mM EDTA in water 
rather than af f inity-eluted with ATP. After reverse 
transcription and PCR amplification, DNA templates were 
transcribed and the resulting RNA was used in the next 

30 round of selection. RNA from the eighth round of 

selection was converted to cDNA, amplified as double- 
stranded DNA by PCR, digested with EcoRl and BamHl, gel- 
purified and cloned into the phage M13 based vector 
pGem3Z (Promega, Madison, WI) . 
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Thirty-nine clones from the eighth cycle RNA 
population were sequenced seventeen different sequences 
were fo\ind. Of these, the most abundant sequence (C8- 
ATP-3) occurred 14 times, and 12 sequences occurred just 
5 once. Compeorison of the seventeen different sequences 
revealed an 11-nucleotide consensus sequence, of which 
seven positions are invariant suaong all clones but one 
(C8-ATP-15). Clones 2, 3, 8, 15, and 19 were 
individually tested for binding to ATP-agsurose* All had 
10 a dissociation constant (K^) of less than 50 /iM, except 
for C8-ATP-15, for which the estimated was -250 /tM. 

To determine the minimal sequence for ATP binding, 
deletions of C8-ATP-3 were analyzed « An active RNA 
molecule 54 nucleotides in length (ATP-54-1) was 
15 ^ generated by a combination of 5' and 3' deletions. This 
RNA can be folded into a secondary structure in which the 
^ll-base consensus is flanked by two base-paired stems. 
Deletion of the left-hand stem abolished ATP-binding 
: activity; dimethylsulphate modification experiments also 
20 supported the proposed secondary structure. Comparing 
sec[uences of all the clones showed that they all had a 
potential to fold into this secondary structure. This 
analysis also highlighted the presence of an invariant 
unpaired G opposite the 11-base consensus. The 
25 orientation and distance of this G and its flanking 

sequences relative to the consensus sequence was variable 
from clone to clone. The stems flanking the conserved 6 
and the consensus were variable in length and 
composition, and frequently contained G-U base pairs. 
30 The simplest explanation for the observation that all of 
the selected clones contained a single consensus sequence 
embedded in a common secondary structure is that these 
clones contain the shortest sequences capable of binding 
ATP with the necessary affinity, and that all other 
35 sequences with comparable or superior affinity are longer 
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and hence less abundant in the initial random sequence 
pool. 

On the basis of these findings, a smaller RNA of 
40 nucleotides (ATP-40-1) was designed, in which the 
5 consensus sequence was flanked by stems of six base 
pairs, with the right-hand stem closed by a stable loop 
sequence for enhanced stability. This RNA bound ATP as 
well as the full-length 164-nucleotide RNA C8-ATP-3 and 
was used for later experiments. Variant 40- 

10 oligonucleotide RNAs were also synthesized to test the 
importance of the highly conserved unpaired G (residue 
G34 in ATP-40-1) for ATP binding. Deleting this residue 
or changing it to an A residue eliminated binding, 
confirming the results of the selection experiments. 

15 To determine which functional groups on the ATP 

are recognized by the ATP-binding RNA, we examined the 
ability of a series of ATP analogues to elute bound ATP- 
40-A RNA from an ATP-agarose column. Methylation of 
positions 1, 2, 3, or 6 on the adenine base, or the 3' 

20 hydroxyl of the ribose sugar, eQ>olish binding, as does 
removal of the 6-fiunino or 2'_hydroxyl. Positions 7 and 8 
on the base can be modified without effect; this is not 
surprising considering that selection was for binding to 
ATP linked to an agarose matrix through its 08 position. 

25 Adenosine, AMP, and ATP are equally efficient at eluting 
the RNA, suggesting that the 5' position on the ribose 
moiety is not recognized by the RNA. 

Isocratic elution (Arnold et al., 
Chromatography 31:1, 1986) from ATP-agarose and 

30 equilibrium gel filtration (Fersht, in Enzyme Structure 
and Mechanism p. 186-188, Freeman, New York, 1985) was 
used to meastire the dissociation constant for the RNA-ATP 
complex on the column and in solution. The of ATP-40- 
1 was -14 /iM when measured by isocratic elution from an 

35 ATP-C8-agarose column, and 6-8 /iM by equilibrium gel 
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filtration* The for the ATP-agarose complex is an 
upper estimate, because the fraction of bound ATP that is 
accessible to the RNA is not known » The solution for 
adenosine was similar to that of ATP (5*>8 /iH) , but the 
5 for dATP was not measurable (>1 mM) . The of ATP-40-1 
for its ligand dropped to 2 /iM when the Hg^^ concentration 
was raised from 5 to 20 mM. Ch£mging the base pair U18- 
A33 to as found in most of the clones initially 

selected, further decreased the to 0,7 /itN. At almost 

10 satxirating concentrations of ATP (50 ^H) , the RNA boimd 
-^O,? equivalents of ATP* The RNA likely binds its 
ligands with a stoichiometry of \init. 

Kethoxal modification (Moozod et al., J> Mol. 
Biol> 187:399, 1987) was used to assess the accessibility 

15 of guanoslne residues to modification. G7 and G17 within 
the loop, and G6 (which forms the G-C base pair on the 
*lef t side of the loop) , all of which are strongly 
protected in the absence of ATP, become highly accessible 
to modification by this reagent in the presence of ATP. 

20 Other guanosine residues, including G8 in the large loop, 
the single unpaired G opposite the loop, and Gs in the 
stems, are highly protected in the presence or absence of 
ATP. These observations suggest that the motif is highly 
structured both in the presence and absence of ATP, but 

25 that binding induces a conformational change in the 
structure of the RNA* 

A pool of random sequence RNAs, using the above- 
identified minimal ATP aptamer as a core structure was 
prepared and used to create polynucleotide kinase 

30 ribozymes. The ATP aptamer is based on that described by 
Sassanfar and Szostak ( Nature . 364:550,1993). 
Selection of Catalytic RNAs; A pool of RNA molecules for 
selection of catalytic RNAs was created based on a 
minimal ATP aptamer core sequence (FIG. 1) . The ATP 

35 aptauner core was surrounded by three regions of random 
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sequence, 40, 30, and 30 nucleotides in length as shown 
in FIG. 2. The ATP-binding domain itself was mutagenized 
such that each base had a 15% chance of being non-wild 
type, to allow for changes in the aptamer sequence that 

5 might be required for optimal activity. To increase the 
likelihood of finding active molecules, an effort was 
made to create a pool containing as many different 
molecules as possible. Because it is difficult to obtain 
an acceptable yield from the synthesis of a single 

.0 oligonucleotide of this length (174 nucleotides), two 
smaller DNA templates were prepared and linked together 
to generate the full length DNA pool (FIG. 2) (Bartel and 
Szostak, science , 261:1411, 1993). The presence of 
constant primer binding sites at the 5' and 3' ends of 

15 the molecules permitted amplification by PCR. 

Transcription of this DMA pool yielded between 5 x lO^^ 
and 2 x 10^^ different RNA molecules. 

In order to select for catalytic activity, it is 
necessary to tag active molecules so that they can be 

20 separated from inactive ones. To accomplish this, the 
random sequence RNA pool was incubated with ATP-y-S and 
the transfer of the thiophosphate from ATP-y-S to the RNA 
was selected for chromatography on a thiopyridine- 
activated thiopropyl sepharose column, which forms 

25 disulfide bonds with RNAs containing thiophosphate 
groups. Molecules without thiophosphates were washed 
away under denaturing conditions. RNAs linked via a 
disulfide to the column matrix were eluted with an excess 
of 2^mercaptoethanol. This overall scheme is illustrated 

30 in FIG. 2. Briefly, the pool was incubated with ATP-y-S 
under conditions designed to promote the formation of RNA 
tertiary structure (400 mM KCl, 50 mM MgClj, 5mM MnClj, 
25mM HEPES, pH 7.4). Mn^'*' was included because of its 
ability to coordinate phosphorothioates. Streptavidin 

35 agarose immobilization of pool RNA was used during the 
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first seven cycles to prevent pool aggregation* After 
cycle 7, the ATP-yS reaction step was performed in 
solution (1 MM MIA) . For the first cycle, 2.4 mg (40 
nmoles; 5 pool equivalents) of random pool RNA was used^ 
5 in the second cycle 150 (2.4 nmoles) RNA vas used, and 
in succeeding cycles 60 (1 nmole) was used. TbB 
selection stetp was performed by incubating the RNA with 
thiopyridine^activated thiopropyl sepharose-6B 
(Pharmacia, Piscataway, NJ) in 1 mM EDTA, 25 mM HEPES, pH 

10 7.4 for 30 minutes at room temperature. The resin was 
then washed with 20 column volumes each of wash buffer 
(IM NaCl, 5 mM EDTA, 25 mM HEPES, pH 7.4), water, and 
finally 3 M urea, 5 mM EDTA to eliminate molecules 
without thiophosphates. RNAs linked to the resin via a 

15 disulfide were eluted with 0.1 M 2-mercaptoethanol in 
0.5X wash buffer. Reverse-transcription, PCR and 
transcription yielded a new RNA pool enriched in active 
molecules. This process comprised one cycle of 
.selection. 

20 Prior to each cycle of the selection, the pool RNA 

generated by transcription was exhaustively 
dephosphorylated with calf intestinal alkaline 
phosphatase to remove the 5 '-triphosphate, and any other 
phosphates that might have been transferred to the RNA by 

25 dutophosphorylation during transcription. 

The selection protocol demanded only that an RNA 
molecule contain a thiophosphate in order for it to be 
isolated. Reactions that could have been selected for 
include: transfer of the y -thiophosphate from ATP-y-S to 

30 the 5' -hydroxy 1 of the RNA (analogous to the reaction 
catalyzed by T4 polynucleotide kinase), to the 3'-end of 
the RNA, to an internal 2' -hydroxy 1, or even to a group 
on one of the bases. Transfer of diphosphate (or perhaps 
the entire triphosphate) instead of a single 

35 thiophosphate is also possible for all of these 
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reactions. A splicing reaction, in which ATP-y-S 
displaces one of the first few nucleotides of the RNA in 
a manner analogous to the reaction catalyzed by the Group 
I introns, could also occur. However, cleavage of more 
5 than the first few bases of the RNA would result in a 
molecule lacking a 5 '-primer binding site, and such a 
molecule would not be amplified during the PGR step of 
the selection. Similarly, any reaction that blocked 
reverse transcription would not be selected for. 

10 The progress of the selection process was 

monitored by measuring the fraction of the pool RNA that 
bound to the thiopropyl Sepharose and was eluted with 2- 
mercaptoethanol {FIG. 3). Initially, -0.5% of the RNA 
bound nonspecifically to the matrix and was eluted by 2- 

15 mercaptoethanol. After five cycles of selection, greater 
than 20% of the pool RNA reacted with thiopropyl 
Sepharose. Since there were at least 10,000 different 
molecules left in the pool at this stage, the stringency 
of the selection in the succeeding cycles was increased 

20 by lowering the ATP-Y"S concentration and the incubation 
time, in order to try to isolate the most active 
catalysts. 

Qntimizatio n Catalytic RNAs: Because the random pool 
initially prepared sampled sequence space very sparsely 

25 (there are between 4^^° and 10^° possible lOO-mers, but 
only approximately 10^^ different molecules in the pool) , 
active molecules are likely to be sub-optimal catalysts. 
Accordingly, three cycles of mutagenic PGR (before 
selection cycles 7, 8, and 9) were performed to allow the 

30 evolution of improvements in the active molecules. 

Mutagenic PGR was performed as described by Bartel and 
Szostak ( Science . 261:1411, 1993) and by Gadwell and 
Joyce ( PGR Methods ApdI. . 2:28, 1992)- Briefly, thirty 
total cycles of PGR were done at each round to yield - 2% 

35 mutagenesis. Reactions of pool RNAs were performed 



wo 96/06944 



PCTAJS9S/10813 



- 27 - 

either with trace ATP-y-^^S, or with 100 /xM ATP-y-S plus 
additional trace ATP-y-^^S. Dithiothreitol (DTT, lo mM) 
was included in the reactions » Reactions were quenched 
by the addition of one volime of 150 nM EDTA, 20 mM DTT 
5 in 95% formamide. Reactions were analyzed by 

electrophoresis on 10% polyacrylcuziide/S M urea gels. 
Quantitation was performed using a Phosphorlmager 
(Molecular Dynamics) . A known amount of ATP-y-^^S was 
spotted on the gels as a standard. The combined effect 

10 of increasing the stringency and performing mutagenic PGR 
was to increase the activity of the pool by nearly three 
orders of magnitude from cycle 6 to cycle 13 (FIG. 4) . 
Catalytic RNAs Identified; After 13 cycles of selection, 
RNA molecules from the pool were cloned using the pT7 

15 Blue T-Vector kit by Novagen, and 50 clones were 

secjuenced. The clones sequenced (FIG. 5) fall into seven 
classes of two or more closely related molecules (19 
clones) and 31 unique sequences. Each class of secpiences 
represents molecules with a common ancestor that acquired 

20 mutations during the course of the mutagenic PGR done in 
cycles 7-9 of the selection. 

Comparison of the sequences in the seven major 
classes of molecules reveals significant conservation of 
the sequence of the original ATP binding site in some of 

25 the active RNAs. FIG. 6 shows the putative structures 
for the ATP aptamer regions from Classes I, III, iv and 
v, the classes for which an aptamer-like structure can be 
dra%m. It appears that Classes I and III have changed 
significantly from the original ATP binding domain, 

30 whereas Classes IV and V are only slightly different from 
the ATP aptamer consensus sequence described by Sassanf ar 
and Szostak ( Nature . 364:550, 1993). Either the right or 
left hand stems of the Class II, VI and VII aptamer 
regions appear to be missing, and it seems likely that 

35 these molecules have found novel modes of binding their 



wo 96/06944 



PCT/US95/10813 



- 28 - 

substrates ♦ Using run-off transcription of synthetic DNA 
oligonucleotides (Hilligan and Uhlenbeck, M^thO^^ 
Enzymol, 180:51, 1989) the RNAs corresponding to the 
Class I, III, IV, and V aptamer regions were produced. 
5 The Class IV aptamer RNA binds weakly to C-8 linked ATP 
agarose (Sassanf ar and Srostak, wpyft) t consistent with a 
molecule having a for ATP in the range of 0.05-0 ,5 mM, 
The Class I, III, and IV apt€uaers, on the other hand, do 
not detectably interact with ATP agarose, consistent with 
10 K^s > 0.5 mM for ATP (if they bind ATP at all) • 

Presumably, the cojrresponding classes of kinases have 
developed novel modes of binding ATP-Y'S. 

Characterization of the Cat alyzed Reactions: Pool 13 RNA 
and the members of each of the major classes of kinases 
15 were tested to determine what reactions they catalyze. 
Nuclease PI analysis was performed as follows. RNA (l^M) 
was allowed to react with -l^M ATP-y-^^S in reaction 
buffer for 4-18 hours. The RNA was the separated from 
nucleotides by G-50 spin column gel filtration 
20 (Boehringer-Mannheim, Indianapolis, IN). The RNA was 

digested with nuclease PI as described in Westaway et al. 
ils. Biol, chem. 268:2435, 1993) and Konarska et al. 
( Nature 293:112, 1981). An aliquot was then spotted 
directly onto a PEI cellulose TLC plate (Baker, 
25 Phillipsburg, NJ) and developed in IM LiCl, lOmM DTT (as 
described in Westaway, sHEXa) • The products were 
localized by UV shadowing (for unlabelled GMPaS) or 
autoradiography. Thiophosphate containing nucleotides 
run slower in this system than do the corresponding 
30 phospho-nucleotides, presumably because there is weaker 
interaction between Li* and the thiophosphate than there 
is with the phosphate. 

PEI cellulose thin layer chromatography (TLC) of 
nuclease PI digests of auto-thiophosphorylated RNA shows 
35 two major radiolabeled products, demonstrating that at 
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least two different reactions are catalyzed by the pool 
13 RNAs. If a particular RNA molecule transfers the y 
thiophosphate from ATP-y^S to its own 5'-hydroxyl, the 
nuclease PI digestion should yield labeled GHPaS, since 
5 all of the RNAs begin with guanosine. All members of 
Classes I, II, III, v, and VI yield GHPaS as the sole 
nuclease Pi digestion product, indicating that they are 
5 '-kinases. Classes IV and VII, on the other hand, yield 
a nuclease Pi digestion product that does not migrate 

10 from the origin in the TLC system used. Both RNase T2, 
which hydrolyzes RNA to nucleotide 3 '-monophosphates, and 
nuclease Pi digestion of reacted Class IV and Class VII 
RNAs, give products that run as molecules with charges of 
-5 to -6 on DEAE cellulose TLC plates, using a solvent 

15 system that separates based upon the charge of the RNA 
fragment (Dondey and Gross, Anal. Blochem, 98:346, 1979; 
Konarska et al.. Nature 293:112, 1981). These data are 
consistent with Class IV and VII RNAs being internal 2'- 
kinases, since neither nuclease PI nor RNase T2 can 

20 cleave at 2'-phosphorylated sites (Westaway et al., 

J. Biol. Chem. 268:2435, 1993). The products of these 
digestions, then, should be ^^S-labeled dinucleotides with 
5 ' -phosphates or 3 ' -phosphates (for nuclease PI and RNase 
T2 digestions, respectively) and 2 '-mono- or di- 

25 phosphates. 

Experiments in which the RNAs were allowed to 
react with unlabeled ATP-y-S and were then purified and 
reacted with ATP-y-^^P and T4 polynucleotide kinase 
support the proposal that Classes I, II, III, v, and VI 

30 are 5 '-kinases, and that Classes IV and VII phosphorylate 
some internal site. As expected, reaction products from 
Classes I, II, III, v, and VI cannot be labeled by T4 
polynucleotide kinase, consistent with their being 5'- 
kinases. Class IV and VII RNAs, on the other hand, are 

35 efficiently labeled by T4 polynucleotide kinase after 
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they have been allowed to react with ATP-y-S. 
Furthermore, this labeled RNA can be purified on a 
thiopyridine-activated thiopropyl sepharose column, 
demonstrating that the thiophosphate label is not lost 
5 during the reaction with ATP and T4 polynucleotide 
kinase. Thus, the Class IV and VII kinases do not 
catalyze reactions involving their 5'-hydroxyls. 

conclusive evidence for the 2 '-kinase hypothesis 
is provided by partial alkaline hydrolysis of the auto- 
10 thiophosphorylated, S'-^Zp-iabeled RMA. For this 

analysis, RNA was reacted with ATP-y-S as described above 
for TLC analysis, except that 100 iM unlabeled ATP-y-S 
was used. The thiophosphorylated RNAs were purified on 
thiopyridine-activated thiopropyl sepharose, and then 5'- 
15 end labeled using T4 polynucleotide kinase and ATP-y-^^P. 
Alkaline hydrolysis was performed in 50 nM sodium 
carbonate/bicarbonate buffer, pH 9.0, 0.1 mM EDTA for 3 
min. at 90"C. Reaction products were analyzed on an 8% 
polyacrylamide/8 M urea gel. 
20 For RKAs from both Classes IV and VII, a gap is 

seen in the alkaline hydrolysis ladder of the auto- 
thiophosphorylated material that is not present in the 
ladder made with unreacted RMA. The missing bands can be 
most easily explained if the 2'-hydroxyls at these 
25 positions are thiophosphorylated, thus preventing base- 
catalyzed RNA hydrolysis. This experiment permitted 
identification of positions of thiophosphorylationi G62 
in Kin. 10 (Class IV) and G83 in Kin. 62 (Class VII). G62 
is in a putative helix within the ATP aptamer region of 
30 Kin. 10, and G83 is in the random loop between the two 
halves of Kin. 62 's aptamer domain. 

y^n«>»ie Anal yf^s of Kinase RibQZVmes: Kinetic analysis 
of the most active clone from each of the four major 
classes of kinases has revealed that they all obey the 
35 standard Michaelis-Menten kinetics expected of molecules 
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possessing saturable substrate binding sites. Rates for 
each clone were detenained (as described herein) at 6 
different ATP-a-S concentrations, ranging from 2 mM - 2.5 
MM. Values of k^^^ and are shown in Table 1, and range 
5 between 0.03 and 0.37 min"^ and between 41 and 456 mM, 
respectively. 





TABLE 


1 






Kinase Class fClone> 




Km (MM) 


10 


Class I (Kin. 46) 


0.37+0.01 
0.23+0.02 
0.36+0.02 


456+57 
116+41 
352+85 




Class II (Kin. 25) 


0.20+0.02 
0.33+0.02 


41±15 
42+11 


15 


Class III (Kin. 42) 


0.07+0.005 
0.10+0.016 


50+13 
58±28 




Class IV (Kin. 44) 


0.03+0.001 
0.03+0.001 


276+25 
200+22 



The y^c&t Class I-IV ribozymes compares favorably with 
corresponding values for naturally occurring ribozymes, 

20 which range from 0.04 to 2 min'^. Comparison of k^^^/K^ 
is difficult because most natiiral ribozymes have 
oligonucleotide substrates that form base pairs with the 
ribozyme's substrate binding site, leading to very low K„ 
values. A comparison between the kinase ribozymes 

25 described here and the self-cleavage reaction catalyzed 
by the Tetrahymena Group I intron is particularly 
relevant, however, because both reactions use external 
small molecule substrates (ATP-y-S and guanosine 
nucleotides, respectively) to modify themselves. Kin. 25 

30 (Class II) phosphorylates itself with a k^j^t 

approximately 0.3 min"^ and a i^cBt/^ of 6 x lo^ min*^ M"^. 
The Tetrahymena self -splicing intron has a k^at of 0.5 
min"^ and a i^cat/K of 2.5 x lo^ min"^ M"^ (Bass and Cech, 
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Nature 308:820, 1984). Thus, from a vanishingly small 
sampling of sequence space, it has been possible to 
isolate a molecule with autocatalytic activity 
essentially as good as that of a ribozyme found in 
5 nature. 

Class I-IV kinases show specificity for ATP-y-S as 
a substrate. No reaction (<0.1% ATP-y-S rate) could be 
detected with GTPyS, indicating that the RNAs can 
discriminate between similar substrates. Interestingly, 
10 as much as 30% of the cycle 13 pool RNA can use CTP-y-S 
as a substrate, and thus pool 13 does contain molecules 
with less stringent substrate specificities. The Class 
I-IV kinases are also able to discriminate between ATP-y- 
S and ATP (kobfl(ATP-y-S)/kobB (ATP): Class I = 55; Class 
15 II « 300; Class III « 150; Class IV ^ 300; 100 MM ATP, 
ATP-Y-S) . Since these values are significantly larger 
than the three to ten fold intrinsic reactivity 
difference between ATP-y-S and ATP (Herschlag et al.. 
Biochemistry 30:4844, 1991), the data suggest that the 
20 thiophosphate is important for binding, catalysis or 

both. Furthermore, pool 13 RNA is not detectably labeled 
by either ATP-a-^^S or ATP-a-^^P, suggesting that 5' 
splicing is not a reaction that occurs in the pool 
(unless the y-thiophosphate is an absolute requirement 
25 for the molecules that carry out this reaction). 

Rate Acceleration : The uncatalyzed background reaction 
for the thiophosphorylation of RNA (or guanosine) by ATP- 
y-S was not detectable. Based on the sensitivity of 
these experiments, the lower limit for the rate 
30 acceleration of the kinase ribozymes is roughly 10^-fold. 
At 70 •C the rate of hydrolysis of ATP in the presence of 
Mg2* is -4 X lO*-* min"^ (pH 6-8) . Correcting for the 
temperature and 55 M water, this value gives a second 
order rate constant of approximately 1 x lo"^ min"^ M"^. 
35 ATP-y-S should hydro lyze 3-10 times faster than ATP. 
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Taking this factor into account, the approximate rate 
enhancement of the present ribozymes 

tJ^cat/^J/fJChydrolysis3' would be 6 x 10^ min"^ fT^Z-lC^ 
min"^ MT^ or 10® - 10^ fold. This enhancement corresponds 
5 to an effective molarity of lo* - 10^ M for ATP in the 
ATP-ribozyme complex (^cat /^hydrolysis " ^-^ min-^/lO-^ nin^ 
IT^) • A comparison of first-order rate constants gives a 
value for the rate enhancement that is independent of 
substrate binding. This value is approximately 10^ fold 
(*^cat/J^hydroly8i8 (1* order) « 0.3 min^/^A x lo"* min"^) . 
This analysis assumes that the mechanism of hydrolysis of 
ATP-Y-S (dissociative) is the same as that used by the 
kinase ribozymes. 

Xntermolecular Catalysis and Turnover: At least one of 

15 the selected kinases is capable of catalyzing the 
phosphorylation of a separate RNA substrate. In 
particular. Kin. 46 (Class I) was demonstrated to transfer 
the y-thiophosphate from ATP-y-S to the 5 '-end of a 6-mer 
-oligoribonucleotide with the same sequence as the 5 '-end 

20 of the ribozyme. To carry out this experiment, RNA was 
incubated as described in FIG. 2 except that 2.5 mM ATP- 
Y-S was used, and 100 5'-HO-GGAACC-3' RNA was added. 
The 6-mer was synthesized by run-off transcription 
(Milligan et al., Metht Enzvmol, 180:51, 1989) and was 

25 dephosphorylated with calf intestinal alkaline 

phosphatase prior to ion-exchange HPLC purification. The 
thiophosphorylated 6-mer marker was made by end-labelling 
5'-GGAACC-3' with ATP-y-^^S using T4 polynucleotide 
kinase. Products were analyzed on 20% acrylamide/8 M 

30 urea gels. Full-length Kin. 46 was found to catalyze the 
reaction approximately 500-fold more slowly than the 
autocatalytic reaction. Part of the reason for the 
decreased activity is likely to be competition for the 
active site between the 5 '-end of the RNA and the 

35 exogenous 6-mer substrate. When the 5- 'constant region 



wo 96/06944 



PCT/US95/10813 



- 34 - 

Of the RNA is removed (via PGR with an internal 5'- 
primer, followed by transcription), the activity 
increases -100-fold, but is still 6 fold below that of 
the auto-thiophosphorylation reaction. (At satxirating 
5 concentrations of 6-mer (100 mM) and ATP-y-S (2.5 mM) the 
initial rate of thiophosphorylation is 0.05 /xM/min with 1 
/iH ribozyme. In comparison, the rate of auto- 
thiophosphorylation for full length Kin. 4 6 RNA (1 MM) 
with 2.5 mM ATP-Y-S is 0.3 ^M/min.) At 25*>C the ribozyme 

10 performs approximately 60 turnovers in 24 hours, and is 
thus acting as a true enzyme. The cause of the lower 
trans activity relative to the autocatalytic activity 
remains unknown, but could involve slow stibstrate binding 
or improper folding of the shortened ribozyme. The off 

15 rate of the 6-mer is not limiting because no burst phase 
is observed in a time course of the reaction. 

The identification of autocatalytic ribozymes 
capable of carrying out catalysis in trans, i.e., 
catalyzing a reaction involving the ligand and a molecule 

20 other than the ribozymes itself can be found by testing 
the ability of the ribozyme to act on a molecule having a 
sequence similar to the region of the ribozyme which is 
modified. 

FIG. 7A illustrates an example of a cis-acting 
25 ribozyme with polynucleotide kinase activity. A ribozyme 
capable of carrying out this catalysis in trans can be 
made by eliminating the 5' end of the ribozyme which 
would otherwise base pair with the 3' end of the ribozyme 
and be kinased. The particular molecule shown in FIG. 7B 
30 is derived from the moleucle illustrated in FIG. 7A and 
transfers phosphate to the 5' end of the short 
oligoribonucleotide GGAACCU. 
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SnMPLB 2 

In a second example of the invention, RNAs which 
bind blotln were first created. Identified, and Isolated 
using a randomized RNA pool. The selected RNAs were used 
5 to prepare a second pool of partially randomized RNAs. 
This pool was then subjected to selection and 
amplification to identify RNAs capable of ligating 
biotin. The overall scheme is illustrated in FIGS. 8A, 
8B, and 8C. 

10 Selection of biotin>bindina RWAss A pool of 

approximately 5 x 10^^ different random sequence RNAs was 
generated by in vitro transcription of a DNA template 
containing a central 72-nucleotide random sequence 
region, flanked at both ends by 20-nucleotide constant 

15 . regions. This pool (random N72 pool) had the following 
sequence : GGAACACTATCCGACTGGCA (N) yjCCTTGGTCATTAGGATCG 
; (SEQ ID NO: 3) (FIG. 8D, also SEQ ID NO: 32) . On 
average, any given 28 nucleotide sequence has a 50% 
^probability of being represented in a pool of this 

20 complexity. The initial pool of RNA (approximately 80 
^g; on average, 2-3 copies of each sequence) was 
resuspended in a binding buffer containing lOO mM KCl, 5 
mM MgCljr and 10 mM Na-HEPES, pH 7.4, conditions chosen 
to favor RNA folding and to mimic physiological 

25 environments while minimizing non-specific aggregation. 
The solution was applied to an agarose column derivatized 
with 2-6 mM biotin (Sigma, St. Louis, MO) and 
subsequently washed with 15 colximn volumes of binding 
buffer. Specifically-bound RNAs were then eluted by 

30 washing the coliamn with binding buffer containing 5 mM 
biotin. Ten ^g of glycogen and 0.3 M NaCl were then 
added to the eluted material, and the RNA was amplified 
as follows. Briefly, the mixture was precipitated with 
2.5 volumes of ethanol at -78*>c. After resuspending the 

35 selected RNA, the reverse transcriptase primer (2.5 /iM) ^ 
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was annealed at 65 *C for 3 min. , and reverse 
transcription (HT) was carried out at 42 for 45 min. 
(using Superscript RT enzyme, Life Technologies, Inc.)* 
PGR amplification was performed by diluting one-fifth of 
5 the RT reaction with the appropriate dNTPs, PGR buffer, 
USB Tag polymerase (United States Biochemical, Cleveland, 
OH), and 0.5 /xN (+) primer containing the T7 RNA 
polymerase promoter* A strong band of the correct size 
was typically observed after 8-15 cycles amplification 

10 (94«C, 1 minute; 55*'C, 45 seconds; 72«C, 1 minute). Half 
of the PCR reaction was used for in vitro transcription 
with T7 RNA polymerase (37*C, overnight). The resulting 
RNA was purified by electrophoresis on an 8% 
polyacrylamide gel. 

15 After six rounds of repeated enrichment, more than 

half of the RNA applied to the biotin column was retained 
during the buffer wash, but eluted during the biotin wash 
(FIG. 9A) . The RNA pool from the eighth round of 
selection was cloned into the pCR vector using the TA 

20 cloning kit (In Vitro-Gen, Inc., San Diego, OA), and 
individual aptamers were sequenced by the Sanger 
dideoxynucleotide method using the universal H13 primer. 
A single sequence (represented by clone BB8-5) accounted 
for >90% of the selected pool (two minor clones account 

25 for the vast majority of remaining RNAs) . 

Previous RNA selections for binding to small 
ligands, including various dyes, amino acids, cof actors, 
and nucleotides, have suggested that aptamers exist at a 
frequency of 10"^^ to 10"^^ in random sequence pools. All 

30 of these ligands, however, have contained aromatic rings 
which could intercalate between RNA bases and/or charged 
groups which might interact electrostatically with the 
RNA backbone. The lower frequency of biotin bindings 
(10"^^) shows that ligands lacking such groups may require 

35 a more complex binding site. 
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Selection for biotin-utlli zina rlbozvmes ; The sequence 
of the biotin aptamer was used to direct the synthesis of 
a second pool of RNAs which was screened for the presence 
of biotin-'Utilizing ribozyxaes (FIG. 8A) . This pool 
5 contained a core of 93 nucleotides (71 nucleotides 
derived from the original random region plus its 22 
nucleotide 5' constant region; FIG, 8D) with the wild- 
type nucleotide (i.e., that which was found in the 
original biotin aptamer incorporated at each position in 
10 the template with 70% probability (the three non-native 
nucleotides each occurring with 10% probability) . 
Deletion analysis indicated that the 3' primer was not 
required for binding and the same sequence was therefore 
used for the 3' primer of the partially-randomized pool. 
15 To allow for the possibility that the 5' primer formed 
part of the aptamer core, the original 5' primer sequence 
was included in the partially-randomized region of the 
new pool and a different 5' primer was appended for 
amplification. Because of differences in the relative 
20 rates of phosphoramidite incorporation during DNA 
synthesis, a biased mix of all four nucleotides was 
prepared with molor ratios of 3:3:2:2 (A:C:G:T). This 
mix was added to pure phosphoramidite stocks (A and C: 
64% pure stock, 36% random mix; G and T: 55% pure stock, 
25 45% random mix) to yield mixed stocks for pool synthesis. 

Twelve random bases were added to either end of 
this core sequence and new constant primers for PGR 
amplification were included. The synthesis of this 156 
nucleotide DNA sequence yielded a pool containing 8 x 10^^ 
30 different molecules, which were transcribed to yield a 
pool of RNA molecules clustered in sequence space around 
the original biotin aptamer sequence. The total yield 
from the DNA synthesis was approximately 77 (1.52 
nmole) . The quality of the synthetic DNA was determined 
35 by a primer extension assay, which showed that only 8.7% 
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of the DNA molecules could serve as full length templates 
for Taq polymerase ♦ The pool thus contains 1.52 x 10*^ x 
6.02 X 10^^ X 0.087 » 8 X 10^^ distinct sequences. 

This second RNA pool was used to identify 
5 ribozymes able to enhance the rate of self-alkylation 
with the haloacetyl derivative, N-biotinoyl-N'- 
iodoacetyl-ethylenediamine (BIE; Molecular Probes, 
Eugene, OR) . BIB is normally used to biotinylate 
proteins by reaction with free cysteine sulfhydryls. To 

10 provide one potential internal substrate for the 

alky lat ion reaction, the doped pool was transcribed in 
the presence of exceses 8-mercaptoguanosine, thus yielding 
RNAs containing a single free thiol in the 5 '-terminal 
nucleotide. Following an overnight (15 hour) room 

15 temperature incubation with 200 /xM BIE, RNAs that had 
undergone the self -biotinylation reaction were isolated 
by streptavidin agarose chromatography. 

In particular, reaction with BIE was terminated by 
the addition of 100 jM jS-mercaptoethanol , 5 mM EDTA, 0.3 

20 M NaCl, 50 nq tRNA (fi^ coli . RNAse-free, Boehringer- 
Mannheim, Indianapolis, IN) . After five minutes, the 
mixture was precipitated with 2.5 volumes ethanol on dry 
ice. After washing and resuspension, the RNA was applied 
to 0.5 ml of a 50% slurry of streptavidin agarose in wash 

25 buffer (1 M NaCl, 10 xaM NaHepes, pH 7.4, 5 mM EDTA) that 
had been washed with 50 /ig tRNA. After rocking 30 
minutes to allow streptavidin-biotin binding, the mixture 
was transferred to a 10 ml-column and washed with 4 x 10 
ml wash buffer and 2 x 10 ml distilled water. 

30 RNA bound to streptavidin could be affinity eluted 

by first saturating the free biotin-binding sites with 
excess biotin and then heating in the presence of 10 mM 
biotin at 94 *C for 8 minutes. Amplification of the 
resultant molecules (by reverse transcription, PGR, and 

35 transcription) yielded a pool enriched for catalysts. 
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After three rounds of selection, an Increase in 
the proportion of RNAs binding to the streptavidin was 
observed (PIG. 9B) • By the fifth round, 10% of the RNA 
ligated the biotin substrate. To select for the most 
5 active catalysts, the incubation time was progressively 
shortened from 15 hours to 30 minutes to 1 minute. After 
eight rounds of selection, no further increase in 
activity was observed suggesting that the complexity of 
the starting pool had been exhausted. Sec[uencing 

10 individual clones from the selected pool showed that 50% 
of the ribozymes were very closely related and were 
derived from a single progenitor. One of these clones, 
BL8-6, catalyzes self-biotinylation at a rate of 0.001 
min"^ in the presence of 200 mH BIE. 

15 The rate of self-biotinylation was determined by a 

time course experiment. ^^P-labelled RNA was first 
resuspended in incubation buffer (100 mM KCl, 10 mM Na- 
Hepes, pH 7.4, 5 mM MgClj) and allowed to equilibrate for 
10 minutes at room temperature. 200 /xM BIE was added to 

20 the mixture and aliguots were subsequently removed after 
0 to 120 minutes of incubation. Samples were quenched 
and affinity purified as described in Haugland, Molecular 
Probes Handbook of Fluoprescent Probes and Research 
Chemicals. Aliguots were counted in a scintillation 

25 coxinter following ethanol precipitation (total RNA count) 
and following binding to streptavidin agarose (product 
RNA count) ; the ratio of these two counts is the fraction 
reacted. 

Optimizing enzvmatic activity! it seemed likely that the 
30 original RNA pool from which the BL8-6 ribozyme was 

derived might not saturate the space of biotin-ligating 
ribozymes. To test the possibility that appropriate 
additional mutations to the BL8-6 sequence might increase 
its catalytic activity, a third RNA pool was generated 
35 based on its sequence but with non-wild-type nucleotides 
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substituted at each position with 30% probability (FIG. 
8D) (using methods descxibed above) . The selection for 
catalytic activity was repeated as described above, but 
with both the reaction incubation time and the BIE 
5 concentration progressively lowered to select for the 
most active enzymes. After eight rounds of selection 
(ending with a 1 minute incubation period at 10 MM BIE) , 
active clones from the pool were sequenced and assayed 
for catalytic activity. Ribozymes in this pool were 

10 uniformly more active than their BL8-6 progenitor, with 
one clone (BL2. 8 -7) catalyzing self-biotinylation at a 
rate of 0.05 min"^ in the presence of 100 mM BIE (one 
hundred fold more active than BL8-6) . 
Nature of the reaction product ; The observation that 

15 BL8-6 ribozyme transcribed without 8-mercaptoguanosine 
catalyzed the self-biotinylation reaction as efficiently 
as the thiol-containing SNA indicated that some site 
other than the free thiol in the 8-mercaptoguemosine base 
at the 5'*end of the RNA might serve as nucleophiles for 

20 the alky lat ion reaction. However, the observation that 
BL8*6 ribozyme transcribed without 8-mercaptoguanosine 
catalyzed the self-biotinylation reaction as efficiently 
as the thiol-containign RNA indicated that some other 
site was being alkylated. To identify the reactive site, 

25 5 '-end labelled BL8-6 ribozyme that had reacted with BIE 
was subjected to alkaline hydrolysis, and the resultant 
ladder of molecules was affinity purified on streptavidln 
agarose. In particular, RNA was partially hydrolyzed by 
heating to 90^ C for 7 minutes in the presence of 100 mM 

30 NaHCOj, pH 9.0 and subsequently ethanol precipitated. 
After resuspending in wash buffer, biotin-labelled RNA 
was affinity purified as described by Haugland r supra ) . 
Purified non-biotinylated RNA was obtained from the 
initial f lowthrough fraction from the streptavidin 

35 agarose slurry (prior to washing) . Full length RNAs and 
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those with the approximately 60 3 '-terminal nucleotides 
deleted were retained by the streptavidin %rtiereas shorter 
molecules were not bound. This result maps the biotin 
attachment site to the region • . . 5 ' -^^qqj^qq^j^ioo.j / ^ ^ 
5 Alkylation at the K7 position of purines leads to RNA 
strand scission following treatment with sodium 
borohydride followed by aniline acetate (this reaction 
serves as the basis for the RNA chemical sequencing) 
(Peattie, PrffPt Matl^ Acad> Sci, ag& 76:1760, 1979). RNA 
10 incubated with BIE, purified on streptavidin-agarose, and 
treated in this manner was cleaved at G^^ (. .GGACGUAAA. • ) 
(PIG. lOA). Briefly, RNA was dissolved in 1.0 M Tris- 
HCl, pH 8.2 and 0.2 M NaBH4. Following a 30 min. 
incubation, the reaction was quenched with 0.6 M sodium 
15 acetate/0.6 M acetic acid, pH 4.5, containing carrier 
tRNA. Following precipitation and rinsing, the RNA was 
treated with 1.0 M aniline/acetate, pH 4.5 at 60*C for 20 
min. No G^^-specific cleavage was observed for RNA that 
had been exposed to BIE but not biotinylated (i.e. the 
20 streptavidin flowthrough fraction). 6^^ is therefore the 
alkylation site for the ribozyme. 

To further characterize the alkylation product, 
the BL8-6 ribozyme was transcribed with [a-^^P]-GTP, thus 
labelling phosphates attached to the 5'-hydroxyl of all 
25 guanosines in the RNA. Following reaction with BIE, 
biotinylated RNA was streptavidin-purif ied and 
subsequently digested to 5 ' -monophosphate nucleotides 
with snake venom phosphodiesterase I. Labelled RNA was 
diluted with 25 ixL 10 mM NaCl, 10 mM MgClj, 10 mM Tris- 
30 CI, pH 7.4, and 5 phosphodiesterase I (Boehringer- 
Mannheim, Indianapolis, IN) and incubated for 20 hrs at 
37 •c. Thin layer ion exchange chromatography was carried 
out by spotting plates pre-run with water to remove 
excess salts and then developed with 6 M formic acid. 
35 The PEI cellulose plates (j.T. Baker Co., Phillipsburg, 
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NJ) indicated the presence of a radioactive species in 
the streptavidin-purified RNA that was absent from the 
streptavidin-flowthrough RNA. This adduct migrated more 
rapidly than 5'-GMP in this TLC system, and co-migrated 
5 with 7-methyl 6MP, suggesting that the adduct carries a 
positive charge, consistent with alkylation at N7 (FIG. 
lOA and FIG. lOB) . Although the possibility of 
alkylation at Ml or N3 cannot be ruled out, alkylation at 
either of these sites would not be expected to lead to 
10 strand cleavage following aniline treatment, but would be 
expected to disinipt reverse transcription, thus 
preventing catalysts using these nucleophiles from being 
enriched during the in vitro selection procedure. Taken 
together, these results strongly suggest that N7 of g'* is 
15 the alkylation site. 

Thft eatalvaed rate *>nhaneement t The background rate of 
guanosine alkylation by BIE was determined by two 
independent methods. First, radiolabelled random 
sequence RNA (from the pool used to isolate the original 
20 biotin binder) was incubated for 24 hours with or without 
200 MM BIE. The specific increase in the fraction bound 
by streptavidin agarose (0.15%) after extensive washing 
was taken as a measure of the background reaction. 
Assuming an average of 28 guanosines/RNA sequence, this 
25 fraction corresponds to a non-catalyzed alkylation rate 
of 2.3 x 10"^ s'^ M"^. In a similar approach, low 
concentrations of [a-^^P]-GTP were incubated overnight in 
the presence or absence of 200 iM BIE and after 12 hours, 
affinity purified by streptavidin agarose. The fraction 
30 specifically bound (3.4 x lO'^) indicates a non-catalyzed 
rate of 2.3 x 10~* a'^WT^, in close agreement with that 
obtained from the RNA labelling experiment. A time 
course experiment with BL2.8-7 RNA yields a catalyzed 
biotinylation rate of approximately 8s"^M"^. The ribozyme 
35 rate enhancement is thus approximately 3 x 10^, 
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comparable to that of the most active catalytic 
antibodies although substantially less than that of many 
natural protein enzymes (Treuaontano et al., 2^ Am> chem. 
SQSLm. 110:2282, 1988; Janda et al., lbld> 112:1275, 1990). 
5 Structural differences between the biottn binder and th^ 
biotin liaator; Given that the biotln ligator arose by 
mutagenesis of the biotln binder sequence and that both 
molecules interact specifically with biotin, we expected 
to find significant structural similarities between the 

10 two RNAs. Simple compeurison of their primeury sequences, 
however, failed to Identify a well-conserved domain that 
might play a fxmctional role; mutations appear randomly 
distributed along the length of the two secpiences. To 
characterize the functional cores of the two molecules, 

15 we analyzed the sequences of active clones isolated from 
the two mutagenlzed RNA pools generated from the biotin 
aptamer and self -alkylating ribozyme sequences. After 
four rounds of reselection with the biotln aptamer- 
derived pool, >40% of the applied RNA bound tightly to 

20 biotln agarose. Similarly, three rounds of re-selection 
of the self-alky latlng rlbozyme-derived pool yielded a 
collection of RNAs with activity matching that of the 
original BL8-6 clone, and five additional roimds of 
selection Increased the activity -100-fold. 

25 Approximately thirty individual RNAs from each of these 
subcloned pools were sequenced and analyzed to determine 
which nucleotide positions were conserved and which pairs 
of nucleotides covarled to maintain Watson-Crick base 
pairing. The results of these experiments are summarized 

30 below and in FIG. 11, PIG. 12A, and FIG. 12B. 

Two regions of the biotln binder are very highly 
conserved in clones that retain binding activity (FIG. 
11). Mutations at the 5' and 3' ends of the first 
conserved domain (changing the A^^.g''^ pair to either C:G 

35 or A:T) suggest a hairpin structure stabilized by a 4- 
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base-pair Watson-Crick duplex. Seven non-paired bases in 
the middle of the first domain directly complement the 
3 /-terminal half of the second conserved domain, thus 
suggesting a pseudoknot structure (FIG. 12A) . In that 
5 the bases in these conserved domains are essentially 
invariant, the sequence data provide no covariational 
evidence for the pseudoknot. To test the proposed 
structure, a series of site-directed mutants was 
generated and assayed for binding to biotin agarose. 
10 Single-base substitutions that disrupt proposed Watson- 
Crick base paif s in the pseudoknot completely abolish 
biotin binding while compensatory second site mutations 
that introduce non-native Watson-Crick base pairs are 
able to largely restore biotin binding. These data 
15 strongly support the proposed pseudoknot model for the 
biotin aptamer. 

Compeurison of the sequences of active ribozymes 
from the BL8-6 re-selection indicate a striking change in 
structure relative to the original biotin binder. 
20 Nucleotides involved in the pseudoknot base-pairing (53- 
70, 101^107), virtually invariant in the biotin binders, 
are poorly conserved in the enzyme sequences (FIG. 11) . 
In contrast, the ribozyme sequence in the region 
corresponding to the variable connecting loop of the 
25 biotin binder (nucleotides 71 to 94) appears to be well 
conserved, suggesting a structural role. Nucleotides 
that are very highly conserved in the biotin binder but 
not involved in the pseudoknot base pairing (...5'" 
'5cGAAAAG^°i-3'...) are retained in the self-alkylating 
30 enzymes but with a highly conserved change to ...5'" 

'5cGUAAAG^°^-3' . . . These results suggest that the change 
in function from biotin binding to alkylation of RNA with 
BIE is achieved by major structural rearrangements. 

Further analysis of the BL8-6-derived sequences 
35 suggested a clover leaf structure with several remarkable 
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similarities to tRNA (FIG. 12B) • The sequence .S'- 
^^ACGUAAA^^-a ' . . . is presented as the tRNA variable stem, 
flanked on either side by extended duplexes (as indicated 
by several observed Watson-Crick covariations) . The 
5 single guanosine in the variable stem serves as the 
internal alkylation site for the enzyme. One 
interpretation of these results is that the 
hexanucleotide segments CX3AAAA and CGUAAA directly 
mediate the interaction with biotin in the biotin binder 

10 and the biotin ligator respectively, although they are 
presented in strikingly different secondary structiire 
contexts. Comparison of ribozyme sequences from the 
third and eighth rounds of reselection suggest that the 
increase in pool alkylation activity is achieved by 

15 -optimization of Watson-Crick base pairing in the 

cloverleaf duplexes and an increased fraction of purines 
(particularly adenosine) in the loop that caps helix 3. 

To test the cloverleaf model for the biotin 
ligator, a synthetic ribozyme was designed by modifying 

20 one of the re-selected sequences such that 1) primer 
sequences at the 5'- and 3'- ends not involved in the 
cloverleaf were deleted; 2) non-conserved bulges in the 
putative helices were removed, and 3) the variable loop 
of approximately 45 nucleotides was replaced by a three 

25 nucleotide loop sec[uence. The predicted lowest energy 
structure for the resulting 99-nucleotide molecule is 
shown in FIG. 13. This highly simplified structure has 
-10 fold lower activity than the best re-selected clone, 
but is still -10 fold more active than the original BL8-6 

30 ribozyme, thus supporting the proposed cloverleaf 
structure (FIG. 13) . 

Two-component ribozvme; For a ribozyme to properly 
qualify as an enzyme, it must emerge from the catalyzed 
reaction unmodified. The self -alkylating ribozyme, which 
35 has been selected to covalently modify its own active 
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site, fails to meet this requirement* The cloverleaf 
secondary structure, however, immediately indicates a way 
to engineer the ribozyme into two self -associating parts, 
one of which (BL-S) can function as a substrate for 
5 biotinylation while the other (BL-E) acts as a true 
enzyme (FIGS, 14 A and 14B) . A low level of BL-S 
biotinylation, corresponding to the non-catalyzed rate of 
alky lat ion was observed in the absence of BL-E. The 
initial rate of biotinylation of the RHA substrate 

10 increased linearly with increasing concentrations of BL- 
E, although the concentration of product never exceeded 
the concentration of enzyme. This result indicates that 
the two RNA pieces can associate with the BIE substrate 
to form a ternary complex capable of true catalysis. The 

15 extensive Watson-Crick base-pairing that drives complex 
formation most likely prevents dissociation of the 
biotinylated product and thus limits the enzyme fragment 
to a single catalytic event. Destabilizing the enzyme- 
substrate duplexes should make it possible to form a 

20 kinetically reversible complex that will dissociate after 
substrate biotinylation, allowing multiple rounds of 
turnover. 

Nucleic acids produced by the method of the 
25 invention can be used as in vitro or in vivo catalysts. 
In some cases the nucleic acids may be used to detect the 
presence of the ligand. For example, the nucleic acid 
may bind the ligand and catalyze a reaction which 
converts the ligand into a readily detectable molecule. 
30 The ribozymes created by the method of the invention can 
also be used in assays to detect molecules modified by 
the ribozymes which are not themselves ligands, e.g., an 
RNA phosphorylated by a polynucleotide kinase ribozyme. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATIONS 

(i) APPLICANTt Szostak, Jack W. 

Lorach, Jon R. 
Wilson r Charles 

(il) TITLE OF INVENTION: NOVEL RIBOZYHES AND NOVEL RIBOZYMB 

SELECTION SYSTEMS 

(ill) NUMBER OF SEQUENCES: 91 

<iv> CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02110-2804 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30B 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/299,498 

(B) FILING DATE: Ol-SEP-1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,162 

<C> REFERENCE/DOCKET NUMBER: 00786/245001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 542-5070 

(B) TELEFAX: (617) 542-8906 

(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSx single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGAACCUCUA GGUCAUUAAG A 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTIONS SEQ ID N0i2: 
ACGUCAGAAO GAUCCAAO ^® 
(2) INPOBMATION FOR SEQ ID NO»3s 

<i> SEQUENCE CHARJlCTERXSTICSt 

(A) LBNGTHt 110 base pairs 

(B) tnrPEt nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIONS SEQ ID N0t3i 

GGAACACTAT COGACTGGCA NNNNNNNNNN NNNNMNNNNN NNNNNNNNNM NNNNNNNNNN 60 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNCCTTGGTC ATTAGGATCG HO 

(2) INFORMATION FOR SEQ ID N0s4i 

(i) SEQUENCE CHARACTERISTICS s 
<A) LENGTHS 32 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESS: Single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs4s 

GGUGGGAA6A AACUGCAGCU UCGGCUGGCA OC ^2 

(2) INFORMATION FOR SEQ ID NOs5s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS S single 

(D) TOPOLOGY s linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID HOtSs 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN OGAGGGAAGA AACUGOGGCA 60 
CCNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNAGDGCCGG CUCGNNNNNN NNNNNNNNNN 120 

134 

NNNNNNNNNN NNNN 

(2) INFORMATION FOR SEQ ID NO: 6s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6s 
TGATTOGCTA GCACGTCATT GGCTGGTAAC ACATGACACT ATACGAGOGA AAAAACTACG 60 
GCACCCTGGT CCGTTAGGGA CAACGACTAA AGTTAGTGCC CACGGGGCTC GTTCA6GGGG 120 

127 

GGCACGG 
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(2) INFORHMIOK FOR SEQ ID NO: 7: 

(1) SBQUSNCB CHARACTERISTICS s 

(A) LENGTH: 115 base pairs 

(B) TYPE I nucleic acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACTCTCGCTA CCACCTTATT GGCTCGTAAC ACCTGACACT ATACGA6CGA AAAAACTACC 60 
GCACTCTGGT C06TA0GGGC CATGGACTTA AGATAGTGCC CACGGGGCTC GTTCA 115 
(2) INFORMATION FOR SEQ ID NOiB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGACTCACTA CCAOGTTGTT GGCTGGTAAC ACCCGACCCT ATACGAGCGA AAAAACTACC 60 
GCACTCTGGT CCATAOGGGA CTTGGACTAA AGTTAGTGCC CACGGGGCTC GTTCAGGGGG 120 
GGCACG6 

(2) INFORMATION FOR SEQ ID NO: 9: 



127 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGACTCACTA GOGCGTTATT CGCTGGTAGC CCCTGACACT ATACAGCGAA AATACTGCGG 60 
CACCCTGGTC CGTACGGGAC ATGGACATTA TGTTAGTGCC CACGGGGCTC GTTCAGGGGG 120 
GGCACGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
GGATATGTTG ATTCGCCCCA GCCTATAAAG TGACTCAATT CGAGGGACGC AACTACGGCA 60 
CCGTCTATCT GAATCGGACG CGGAACTTGT GCCGTCTCTA CTCTAACGTT AGCGGAAAAC 120 
GTCGGTTGCG ^^3^ 
(2) INFORMATION FOR SEQ ID NO: 11: 
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(1) SEQUENCS CHARACTEKISTICS: 

(A) 12N6TH3 130 base paira 

(B) TYPBt nucleic acid 
<C) STRANOEDNBSSt single 
(D) TOPOLOGY : linear 

(xi) SBQUENCB DESCRIPTIONt SEQ 10 NOslls 
AGATGTGTC6 ATTCGCCACA GCCAACAAAG CGGCCCAATT CGA6G6AC6C AACTTCGGCA 60 
COOTCTATCA GAAOOGOACG OGGTTCTAGT OCCGTCTCTA TCCTAAOGTT AGCGGAAAAG 120 
GAGGGTTGOQ 

(2) INFORMATION FOR SEQ ID NOsl2t 

(i> SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 130 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGATGTGTTG ATTCGCCTCG 6CCTGTTTAG TGACCAATTT C6AG6GACGC AACTTCGGCA 60 
CCGTCTACCT GCAATAGACG AGGTACTTAT GCAGGCCCTA CTTTAACGTT AGCGGGAAAC 120 
6AGGGTTGCG 

(2) INFORMATION FOR SEQ ID NO: 13: 



130 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGTCTACATG GAAGTTGTAC TATCTAAGTG TACTCACCAA AGACGAGGGC AGGAAATAOG 60 
GCACCATTGG CTACGCAAGG CCCAAGTGCC CGGCGTCGTT TCAGAAAGGA TAACGTTAGC 120 
CTG "3 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
AGGCCACTTA GATCTOGCAC TATCTAAGCG TACACGCCAA TTAOGAGGGC AGGAAATAC6 60 
GCACCTCCAG CTACGCAAGG CCCCAGTGCC CTGCCTCAGT TCGGAACGGA TAACGTTACC 120 
CTG 1" 
(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESSt single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AGACCTCGTG TAAGTCGTAC TATCTACGAG TGCACACGAA TACGAGGGCA GGAAATACGG 60 

CACCATAACT AOGCAAGOCC CAAOTCCCCO GCCTTGATTC AGAAOGOATA ACGTTAGCCT 120 

« 121 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTATTTCGTT OGCACCCAGT GATOGCTCGG GACTGGGGCC TCCGCTAGGG AGGACATTGC 60 
GGCACCCAAA CGACCACACA GAAOGTGCTA ACGATAGTGC CGGCTACCAT CCGTGAATGA 120 
ACTGCTGCTG CTGGCG 

<2) INFORMATION FOR SEQ ID NO: 17: 



136 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 

ACAAGTTGTT OGCACCCAGT GAAOGGTCGG GACTGGGGCC TCCGCTAGGG AGGACATTGC 60 

GCACCCCAAC TATCACTCA6 AACGTGCTAT OGATATAGCC GGCTAGCACC TGATTATGAA 120 

CTGCTGCTGC TGGCG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGATATTGTT CGCACCCTGC GATOCCTTGG GACTGGGGCC TCCGCTAGGG AGGACATTGC 60 

GGCACCCAAA CTATCACTCA GAACGTGCTA ACGATAGTGC CGGCTAGCTT CTGTAAGTGA 120 

ACTGCTGCTG TTGG06 236 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CKARACTERXSTICSt 

(A> LENCTHt 137 base pairs 

(B) TYPSx nucleic acid 

(C) STRANDSDNESSt single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGACCTTAAT TCGAAAGCGT ATTCAACTTA CCATATCTCG CGCCGAGGGA AGGACCATCO 60 
60GCCAACTA CAGAGCOOTO OTTA6CGGAC TCCGCAOTGC CGGCTCOGGG AATAGGGTTC 120 
TCAC6AATTA COGGCAT ^^"^ 
(2) INFORKATIC»l FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AGGCCTTAAT TCGAAAGCGT ATTOGACATA CCATATTTTG CGCCGAGGGA AGAT0CTT06 60 
GCACAGACTA CAGCGTC6AG GTGAGCGGCG CACACTGTGT CGGCTCOGGG AATAGGGTTC 120 
TCACGAATTA CCGGCAT 137 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0t21: 
AGATGTGGTT GCATAGTAGG CAGCCGGGCA CTTACGCCGA ATCGAGGGAC GAGACCGGA6 60 
CACCACGATG COCCGCGATA CCTCATTTGG GATTAGTGCC GGCTAGGAAA GTGAGTTCCT 120 
TATGACCTGC CTCCAC 136 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AGATGTGGCG GCATAGTAGG CAGCCGAGCA CTAA06CCAA ATCGAAGGAC GA6ACTGCGG 60 
CTCCACGATG CGCOGCGATG CCACTTTTGA GATTAGTACC GGCTGGGAAA GTGAATTCCT 120 
TCTGGCCTGT CTCCAC 136 
(2) INFORMATION FOR SEQ ID NO: 23: 
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(1) SEQUENCE CHARACTERZSTXCSt 

(A) LENGTH: 137 base pairs 

(B) TYPEt nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY I linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23< 
A6AT06ATTG GAGACGCCCT GGCGTACTTT AGGTA6AAAA CTCCGACGGA AAAAACTGCG 60 
GCACC6TG6G AGTAGAOGAT AGATAACAGG 6CATTAGTGC CGGCCTCGCA AAGCTACCAT 120 
GA6ATGGAGC GATCAGG X37 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGTTAGATTG GAAGC6CCCC GACTTACTTT AGCTTGAAAA CTCCGACGGA AAAACTACAG 60 
CACOCTGGGA GTAGAGGATC GGATATCAGG CATTAGTGCC GGCCTCGTAA AGCTACCAGG 120 
ATATTGGGAC GATCAGG 3^37 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CGAGGGAAGA AAAUGCGGCA CCAGUGCCGG COCO 34 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ACGAGCGAAA AAACUACGGC ACUAGUGCCC ACGGGGCUCG U 41 

(2) INFORMATION FOR SEQ ID NOt27: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
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CGAGGGCAGG AAAUAOGGCA CCAGUGCOCG 6CCUU6 36 
(2> INFORMATION FOR SEQ ID NOt28s 

(1) SEQUENCE CBARACTERISTICSi 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSSs Single 

(D) TOPOLOGY: linear 

(Xi> SEQOENCE DBSCRXPTZONt SEQ ID NOi28: 
6CDAGG0A60 ACAUUGC6GC ACCA6UGC0G GCUAGC 36 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNSSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
CCGAGGGAAG AUCCUUCG6C ACAUGUGUCG 6CUC6G 36 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGAACCUACG AGCGAAAAAA CUAC6GCACU CUGGUCCAUA CGGGACUU6G ACUAAAGUUA 60 
GUGCCCACGG 6GCU06UUCA AG6UUCUCAC GG 92 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 
ACGAGCGAAA AAACUAC6GC ACUCUGGUCC AUACG6GACU UGGACUAAAG UUAGUGCCCA 60 
CGGGGCUCGU UCAAGGUUCU CAOGG 85 
(2) INFORMATION FOR SEQ ID NO:32: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
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GCAACACTAT CC6ACTGGCA CCNNNNNNNN NNNKNNNNWN NNNNNNNNNN NKNNNNNNNN 60 

NNNNNNNNNN NKHimNNNmi HNNNNNNNNN NNNNCCTTCG TCATTAGOAT CG 112 

(2) INPORMATIOH FOR SBQ ID NOt33t 

(1) SBQtlENCS CHARACTERISTICS t 

(A> LENGTHx 156 base pairs 

(B) TYPBt nucleic acid 

(C) STRANDBDNESSt single 

(D) TOPOLOOirs linear 

(xi) SBQDENCB DESCRIPTION t SEQ ID KOt33: 
GGAGCCAOCA CGGTOGGATC CNMNNNNNNN NNNCGAACAC TATCCGACTG GCAAAGAOCA 60 
TAGGCTCGGG TTGCCAGAGG TTCCACACTT TCATOOAAAA GCCTATCCTA GGCAATCACA 120 
TGGACTNNNN NNNNNimNCC TTGGTCATTA 60ATC0 156 
(2) INFORKATION FOR SEQ ID NO:34< 

(i) SEQUENCE CRARACTBRISTICSt 

(A) LBNOTHt 156 base pairs 

(B) TTPBt nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGYs linear 

<xi) SEQUENCE DESCRIPTIONS SEQ ID NOt34t 
GGAGGCACCA CGGTCGGATC CGCTTTATTA TCATGAGCCC GACTCGACGG GCACTGTACA 60 
TAAGCTTCGG ATGCCATAGT TTAGACACTA TGGACGTAAA GCCCATGCTA GGCAAAGACA 120 
TTGACTGCAT GAGCGCCGCC TTGGTCATTA GGATCG 156 
(2) INFORMATION FOR SEQ ID NO: 35: 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

NNNNNNNNNN NNGGAACACT ATCCGACTGG CACCGACCAT ACGCTCGGGT TGOCAGACGT 60 

TCCACACTTT CATCGAAAAO CCTATGCTAG GCAATGACAT GCACTNNNNN NNNNNNH 117 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 117 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TTGCGGTGGG ANGGACCACA TGCCGCCTGG CACCGACCAT AGGCTCGGGT TGCCAGAGGT 60 

TCCACAGTTT CATCGAAAAG CCTATGCTAG GAGGTTACCT AGACTTAGCC GTTCACT 117 
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(2) INFORMATION FOR SEQ ID NOt37s 

(i) SEQUENCE CHARACTERISTICS t 

(A) LBNQTHt 117 base pairB 

(B) TYPEt nucleic acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGYs linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ATTTCGC6AT GGGGA6CACA TAGCAACTGG CACOGACCAT AGGGTOGGGT TGCAAGAGGT 60 
TCCACACTTT CAT06AAAA6 CCTATGCTAG GCAATGACAT GGACTNNNNN NNNNNNN 117 
(2) INFORMATION FOR SEQ ID HO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TCTTCGGAGG CCGTTACAGA CAGACACTGG CACCGACCAT AGGCTCGGGT TGTGTGAGGT 60 
TGCCCAT6TT CATCGAAAAG CCTATGCTAG CCACTGACAT GGACTTTATC CACAAGT 117 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
CAGTTATTCT GCGTAACACA TTCTGACTGA CACCGACCAT AGGCTCGGGT TGCCCTAGTT 60 
GCCACACTTT CAACGAAAAG CCTATGCTAA CCTATGACGT GGACTCCGGC ATGNNNN 117 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CAAAGGTCCT ACGGAATACA CTCTAACTGA CACOGACCAT AGGCTCGGGT CTCCAAAGGT 60 
GCCACATTTT CAGCGAAAAG CCTATGCTAT CCAATGGCAT GAAGTATCAC GTCTACT 117 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DBSCRIPTIONt SBQ ID NOt41t 
mCGCMCGT CCTGAAGACA TTCCCACTTC GACCGACCAT AGGCTCGGOT TCCGAAACTT 60 
OTCTCACATT CTTTGAAAAO CCTATCCTAC CTAGTOACAA GGATTACGCC C3GCTGA0 117 
(2) INFORMATION FOR SBQ ID NOs42t 

(1) SBQOSNCB CHARACTERISTICS I 

(A) LENGTH t 117 base pairs 

(B) TYPBs nucleic acid 

(C) STRANDEDNBSSs single 

(D) TOPOLOGYt linear 

<xi) SEQUENCE DESCRIPTION » SBQ ID NO: 42 s 
ACGTCCGCCA ACGGTGGACA TTCTGACGGC GACCGACCAT AGGCTCGGOT TG6CCCCGGT 60 
TTCATACTTT CATTGAAAAC CCTATGCCAG GCAGTGACAT GAACTTTGAG GTAAAGT 117 
(2) INFORMATION FOR SBQ ID N0s43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) USNGTH: 117 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDEDNESSt single 
(O) TOPOLOGyi linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 43: 
CCCTGTTAAA GAGGAACACA TTCC6ACTGC TACCGACCAT AGGCTCGGOT TCGTTGAGGT 60 
GCCACACATG CATTGACAAC CTTATGCTAG GGGTTCCCAT GCACTNNNNN NNNNNNN 117 
(2) INFORMATION FOR SBQ ID NOi44: 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:44: 
CAA6AACCGG CCGAAAAACA TTCCAACTGG TACCGACCAT AGGCTCGGOT TCCCAGACAT 60 
TACACATTTT CTTTGAAAAO CCTATGATAT CCGCTGACOG TGACCGCTAG CGGCATC 117 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TGGACTTTTC AOGGAACATG TTCCGATTGG CACCGACCAT AGGCTCGGCT TTCCAGAGGT 60 
GCCACAACTT CATTGAAAAG CCTATGCTAG CCAATGACCT GGACCATCAC AAAGGTT 117 
(2) INFORMATION FOR SEQ ID NO: 46: 
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(1) SBQUENCB CHARACTEKISTXCSt 

(A) LENGTHt 117 base pairs 

(B) TTPEx nucleic acid 

(C) STRANDBDNESSt single 
(0) TOPOLOGYt linear 

(Xi) SBQUENCB DESCRIPTIONi SEQ ID NOt46« 
CrrCATTAAA OGGGAAAACA TTCCGACTGG GACCGACCAT AGGCTCGOTT TTTCAGAAGG 
CACTCT6TTG OGTC6ACAAG CCTATGCTGG ACCATGACCT GGACTATTTG CCCAGAT 
(2) INFORMATION FOR SEQ ID NOt47j 

(i) SBQUENCB CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TlfPEt nucleic acid 

(C) STRANDEDNESSx single 

(D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIFTION: SEQ ID NO: 47: 
TGATGAGAGC TACGAACACA CACCGACT6G CACOGACCAT AGGCTCGOTT TGCCTCAGAT 60 
TCTTACCTTT CTTTGAAAAG CCTATGCTTG CTAATGACCT GGATTTGAGA ACANNNN 117 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GACACAAAGC AGGCAACAAA TTCCGACTGG TACCGACCAT AGGCTCGOTT TGCCCGACCT 
TCCACACTTT CATCGAAAAG CCTATGTTAG CTAATGACAG GGAGGACTCG ATGTGGT 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
COGAGCGGTC GGGGACGACA TTCCGTCTGG CACCGACCAT AGGCTCGOTT CTCCAGAGCT 60 
TCCAAACCTT CTTGGAAAAG CCTATGCTGG GCAATGACAT GGACTNNNNN NNNNNNN 117 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
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AGTOTCATAT TAGGGACACA CTCC6TAT0G CACCGATCAT AGGCTOGGTT TGGCACGCGT 60 
GCCACACTTG CAACGACAA6 CCTATGGTAG TCCATAACCT GGACTACAAA CXICOATT 117 
(2) INFORMATION FOR SEQ ID NOiSlt 

(1) SEQUENCB CHARACTERISTICS t 

(A) LENGTHS 117 base pairs 

(B) TlTPEt nucleic acid 

(C) STRANDEDNBSSt Single 

(D) TOPOLOGY s linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NOiSlt 
CCCTAGTG6A TA6GAACACA TTAOGCCTGG CACCGACCAT AGGCTOGGTT GACCAGCGTT 60 
TCCACACTTT CATCGAAAAG CCTATGCTTG CCATTGACAT GGACTCACGC ATTGCAT 117 
(2) INFORMATION FOR SEQ ID NO s 52s 

(1) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

GTGCCGACTT ACGGTTCACA TTCAAACTGG CACCGACCAT AGGCTCGGTT TCCCTAACCT 60 

TTCAAACTTT CATCGAAAAG CCTATGCTGG GCAAOGGTTA GGGTTTCGCA CGGC6AT 117 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCB CHARACTERISTICS: 

<A) LENGTH: 117 base pairs 
<B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

CTGCACAGGT AGGGAACGCA TTTCGACTCC CACCGACCAT AGGCTOGGTC AGCGAGTTGC 60 

GCCCCAATTT CAACGAAAAG CCTATCCTAO GTAATGCCAT GGACTGGTTC GTATCAT 117 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
GACGGAACCG TTTTAACACG TTCCGACCGG CACCGACCAT AGGCTCGGTT TGCCAGAGCT 60 
TCACAACTTT CATCGAAAAG CCTATGAAAT GTAACGACAA GGACTACTCG ACCAGCA 117 
(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LBHOTHt 117 baaa pairs 

(B) TYPES nucleic acid 
(C> STRAHDBDNESSt single 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTIONt SEQ ID NO: 55: 
CGCTOGTCGC GCCGAACAAA TTCCCACAGG CACOGACCAT AGGCTCGGTT TGCCTGTTGC 
TCCACACCTT CATC6AAAAG CCTATOCCCG GCAATCACTT GGCCTTTGGA CGTCATT 
(2) INFORMATION FOR SEQ ID NO: 56 8 

(i) SEQUENCB CHARACTERISTICS: 

(A) I^JGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GCTCTGTTCG GTTCAACAAA TTCACACTGG CAAAGACCAT AGGCTCGGTT TGCCAGAGGT 
GCCACAGTTC ACTCCAAAAG CCTATGATCG CCAATGACAT GTACCTCAOG CTAGGCA 
<2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 117 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
ACACTATGTA CTGGAAAACG TTCGGACACA CACCGACTAT AGGCTCGGTT TGCCATTCGT 60 
GCCACAGTTC CAGOGAAAAG OCTATGOGGG GCCATGACAC GTACTGCCCA GTAACGT 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCB CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
TGCTACTGTT ATGTAACACA TTCCGACTGC GACCGACCAT AGGCTCGGTT TTCCAGACGT 
TCGTCACTTG CTTCGACAAG CCTATGAAAT TCAATGACAT GGCCTGGCTA GGCG06A 
(2) INFORMATION FOR SEQ ID NO: 59: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
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TCTATCGCOO TGCAAACACA CTAOGTCTGO CCCCGACCAT AGGCTOGGCT TGCCAGCGTT 60 
TGCAACGTTT CATCGAAAAO CGTATGCGAT CTAATGACAT CGACCGGAAG GCCCAAT 117 
(2) INFORMATION FOR SEQ ID NO: 60s 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 117 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY s linear 

(xi) SBQXmNCE DESCRIPTIONt SEQ ID NOt60s 

CTAAATTTGG TTCAAACACA TCCAGACTGG CCCCGACCAT A6GCTC6GGT TGTCAGACGT 60 

GCTTCACOTT CCTCGAAAAG CCTATGTGAT GGAAT6ACAT TGACTGAGGG ATGCGGT 117 

(2) INFORMATION FOR SEQ ID N0t61t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GCNGAGGGCT CCGGTACACA TGCAGACTGG TCCCGACCAT AGGCTOGGCT TACCAGACCT 60 
TCAACTACTT CTTCGAAAAG CCTATGCCGG TCAAGGCCAT GAACGCTCAA TCAGTCT 117 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TGTCCGAACG ACGTATGCCA TTCCGTCTGG CCCCGACCAT AGGCTCGGAT TACCATTCGT 60 
TACACACTTT CATCGAAAAG CCTATGCTGT TCAATGGCCC GQACTTCAGT AGATGGT 117 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CGGAATTACA CTGGATCACA TCCCGACTGG CCCCGACCAT AGGCTCGGGT TGCCAGTGCT 60 
TACACCCTTT CACCGAAAAG GCTATGCTAG GCCATGCCAT TAACTNNNNN NNNNNNN 117 
(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LBNGTHs 117 base pairs 

(B) TYPES nucleic acid 

(C) STRANDSOMSSSt single 

(D) TOPOLOGIT: linear 

(xi) SEQUENCE DESCRIPTION i SEQ ID Naj645 
GGTTTATTAT CATGAGCCCG ACTCGAOGGG CACTGTACAT AAGCTTCGGA TGCCATAGTT 60 
TAGACACTAT GGACGTAAAG CCCATGCTAG GCAAAGACAT TGACTGCATG AG0GCC6 117 
(2) INFORMATION FOR SEQ ZD NO: 65: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH t 117 base paire 

(B) TYPES nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTIONS SEQ ID NOs65s 

GGTTTATCAT GTTTTAATCC CTACGC6CTC ACATTTGAAT AACCCGGGAA TTACAGAGTG 60 

TAAACACTAT GAACGTAAAG ACCATGCGAA GCTATGACAC TGACTGCATG 6TCG0GG 117 

(2) INFORMATION FOR SEQ ID NO 1 66: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTHS 117 base pairs 
<B) TYPE: nucleic acid 
(C> STRANDEDNESSs single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

GGGGTTTTTG TC60GGACCC TCGOGACGTT CACTGTACAT AAGCTTCGGA TGCOGTAGAG 60 

TAAACACTGC GGACGTAAAG CTCATGTTGG GTATTAAACC AAACAACATT AGCCCCG 117 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67s 
AGCTTCTCAT CAGTCGGTCC CACTCCACCG ACATTTACGT AACCTTTGGA TGCCATAGTA 60 
AAAACACTAT GGACGTAAAG CGCAACOTAG CCCAAGATAT TGACAGTTTG AGCGCOG 117 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NOs68: 
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GTTTTATGTT CAAGTGCCCC AAAOGGCOGG CACTCTACAT AACCCTCGCA TGCAATAOTC 60 
TAGAOGCTAT TGGTGTAAAG CCCATATTAG ACAAGGAOCT TGTCTTCATG AGOGCCXS 117 
(2) INFORMATION FOR SEQ 10 NO: 69: 

(i) SEQUENCE CKARACTSRISTICSt 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSSs single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 
GTTTTAGCAT TGTGAGCCOC GCTCCACGGT CACTCTGAAG ATGCTTCGOA TGCCATAGTT 60 
CGCACACTAT GGACGTAAA6 ATTGTTCGAO TCACAGACAG TAGCTGCACA ATCGCOG 117 
(2> INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CGTTGAAATA AGCGTTAGGC CTACTTGACG CTCA6TAGGC AATCACCGGA TGCCGTAGTT 60 
TATACACTAT GGACGTAAAG GTCATGCTGT TCTAAGACAT TGTCTGCATG ACCGCOG 117 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
GAAATTTTGT GTGCAGACAC TACTCTCCTG CACCGTTTAA AAGCTT06GA TGCCATAGGT 60 
TAAAAACTAT GGACGTAAAG CGCATGATCG GTAAACACAG TTACTGCATG ATCGCCG 117 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GGTTTATCAT GTTTTAATCC CTACGCGGGT CACATTTGAA TACCCGGGAA TTACAGAGTG 60 
TAAACACTAT GAACCTAAAG ACCATGCGAA GCTATGACAC TGACTCCATG GTCGCGG 117 
(2) INFORMATION FOR SEQ ID NO: 73: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 117 base pairs 

(B) TYPBt nucleic acid 

(C) STRANDBBNBSSt single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION 8 SEQ ID NOi73: 
OGTTTATCAT GTTTTAATCC CTACGOGGGT CACATTTGAA TACCGCGCAA TTACAOACTG 60 
TAAACACTAT GAACGTAAAG ACCATGCGAA GCTATGACAC TGACTGCATG OTCGCGC 117 
(2) INFORMATION FOR SEQ ID NO: 74: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESSi single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74s 
GGTTGAAAAA CATGAGCCAG TCTCGAC6AG ACTTCTCGTT TCTAATCGGA TGCCATAGTT 60 
AAGATACTAT GGACGTAAAG CGCTCGGTAG CTAACAACAG TGTTTGCCAG CGCGCCG 117 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
GGATTGTTAT ACCTTGGCCT GGATCCTAGC CACTGTAGCT ATCATCCGGA TOCGAGAGTT 60 
TAGCCACTCT GGACGTAAAG CTCATGTTAA GAATAGACAT TGAATGCATG AGCGCCC 117 
(2) INFORMATION FOR SEQ ID NOi76i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

GATGCATTAT CTCGCGTGCG TGTAGACGGG GTCGACACGC AAGCTTCGGA TGCCATAGAT 60 

TAGATACTAT GGACGTAAAG CTCATGTTAG TCAAAAACAC TGGCTCCATG AGCGCCG 117 

(2) INFORMATION FOR SEQ ID NO: 77 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
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GGAAAATCAT ATAA0TCC06 TCGCCC06CG AACTTTACX3T AAGATTCGGA T6CCATAGTT 60 
TATCCACTAT OGGTGTAAAO 6TCATGCTAT ACCAACACAT TTAT6GCAT0 ATOOCCG 117 
(2) INFOKMATION FOR SEQ ID NOt78: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LBNGTHt 117 base pairs 

(B) TYPE} nucleic acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGYi linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
GAAATTTTGT GTGCAGACAC TACCCTCCTG CACCGTTAAA AAGCTTCGGA TGCCATAGGT 60 
TAAAAACTAT GGAC6TAAAG CGCATGATOG GTAAACACAG TTACTGCATG TGOGCCG 117 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TTPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
OCTGTATTAC CTTGAGTCCA ACTCCAOGAG CACTATGAAT AATCTTCGGA TGCCATCGTT 60 
TCAACAC6AT GGACGTAAAG CCCACTGTTG GCAAATACAT TGACTGCAGG TGCGCOG 117 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GATGCATTAT CTCGOGTGCG TGTAGACGGG GATC6ACACC AAGCTTCGGA TGCCATAGAT 60 
TAGACACTAT GGACGTAAAG CGCATGTTAG TAGAAATCAA CTGCAGCACG ACCGCCG 117 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GAAATTTTGT GTGCAGACAC TACTCTCCTG CACOGTTTAA AAGCTTCGGA TGCCATAGGT 60 
TAAAAACTAT GGACGTAAAG CGCATGATCG GTAAACACAG TTACTGCATG TGCGCCG 117 
(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 117 baflft pairs 

(B) TYPE I nucleic acid 

(C) STRANDEDNBSSt Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GATTTATTCA TAT6AGCCGG GTTGAAAGTA TAAAGTACTT TAGCTTC6GC T6CCAAAGTT 60 
TATAAACTTT GGACGTAAAG CTCCTGCTTG GCAAATACAA AAGCTGCAC6 AGCGCCA 117 
<2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGTTACTTAA T6CGACCAAC CTACGGGGCA CTGTCTACAT AAGTTTC6GA T6CCATA6TG 60 
ATGCAACTAT GGACGTAAAG CCCATGCCAG ACTAAAACAT TGTCTGCATG CGCGCCG 117 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GGAGTCTTTT CATCAGTCCG ACTCTOCACT CATTGTTCAT AAGCTCCGGA TGCCATAGCT 60 
CAAAAACTAT GGACGTAAAG CCCATCCTAA GCTCTCAAGT TCACFGCAT6 AGCGCCG 117 
(2) INFORMATION FOR SEQ ID NOl85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GATTTATTCA TATGAGCCGG GTTCAAAGTA TAAAGTACTT TAGCTTCGGC TGCCAAAGTT 60 
TATAAACTTT GGACGTAAAG CCCATGTTAG GTAAGATTAT TAACAGCATG TGCGCC6 117 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
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GCTTTATTCT CTCTTGCCCT GATCCACGGQ CAGGATACGA GGGATGCOGA TGCCATATTT 60 
TAAAAAOTAT GGACGTAAAG CCCATGATAA GCAAAGATTG TCACATCATG TGOGCCG 117 
(2) INFORMATION FOR SEQ ID NOt87t 

(1) SEQUENCE CHARACTERISTICS s 

(A) LENGTBt 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOOyj linear 

(xi) SEQOBHCE DESCRIPTION} SEQ ID NOsS?: 
GGAACCAAGG CGGAOCOGGA OGAGAUCCGG AOGCCAUAGO AAAAACAC0A UCGACGUAAA 60 
GCUCAGGCUG AAGACACA6C CUGA6CGCC6 CCOUGGOUC 99 
(2) INFORMATION FOR SEQ ID NOs88t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TVPEt nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
GGAACACUAU GGACGUAAAG CUCAGGCUGA A 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
GGAAACAGCC UGAGCGCCGC CUUGGUUCGA AAGAACCAAG GOGGAUCCGG AUGAGAUCCG 60 
GAUGCCAUAG UAA 

(2) INFORMATION FOR SEQ ID NOi90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0l90: 
GGAACACUAU COGAUGGCAC CGACCAUAGG CUCGGGUUCC CAGAGCUUCC ACACUUUCAU 60 
CGAAAAGCCU AUGCUAGGCA AUGACAUGGA CUCCUUGGUC AUUAGGAUCG no 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNBSSt Single 

(D) TOPOLOGY t linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:91t 
GGAGGCACCA COGCUGGAUC CGGOUUAUUA UCAUGAGCCC GACUCGGGCA GCACUGUACA 
OAAGCUCCGA 0GCCAOAGUU UAGACACUAU GGACGUAAA6 CCCAU6CUAG GCAAA6ACAU 
UGACOGCAUG AGCGCCGCCU UGGOCAUUAG 6ADCG 
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CLAIMS 

1. A method for producing a catalytic RNA 
molecule capable of binding a first ligand and catalyzing 
a chemical reaction modifying said RNA molecule, 
5 comprising the steps of: 

a) providing a first population of RNA molecules 
each having a first region of random sequence; 

b) contacting said first population of RNA 
molecules with said first ligand; 

10 c) isolating a first ligand-binding 

subpopulation of said first population of RNA molecules 
by partitioning RNA molecules in said first population 
which specifically bind said first ligand from those 
which do not; 

15 d) amplifying said first ligand-binding 

subpopulation in vitro ; 

e) identifying a first ligand binding sequence; 

f) preparing a second population of RNA 
molecules each of said RNA molecules comprising said 

20 first ligand binding sequence and a second region of 
random sequence; 

g) contacting said second population of RNA 
molecules with a second ligand capable of binding said 
first ligand binding sequence; and 

25 h) isolating a subpopulation of said catalytic 

RNA molecules from said second population of RNA 
molecules by partitioning RNA molecules which have been 
modified in step g) from those which have not been 
modified. 

30 2. The method of claim 1, wherein said first 

ligand is ATP or biotin, 

3. The method of claim 1, wherein said second 
ligand serves as a substrate for said chemical reaction. 
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4. The method of claim 1, wherein said catalytic 
RNA molecule can transfer a phosphate from a nucleotide 
triphosphate to said catalytic RNA molecule. 

5. The method of claim 4, wherein said transfer 
5 is to the 5' -hydroxy 1 or to an internal 2'-hydroxyl of 

said catalytic RNA molecule. 

6. The method of claim 1, wherein said catalytic 
RNA molecule can transfer a phosphate from a nucleotide 
triphosphate to a nucleic acid other than said catalytic 

10 RNA molecule. 

7. The method of claim 6, wherein said nucleic 
acid is a ribonucleic acid. 

8. The method of claim 1, wherein said first and 
second ligands are the same. 

15 9. The method of claim 1, wherein said catalytic 

molecules can catalyze N-alkylation. 

10. A catalytic RNA molecule which can transfer a 
phosphate from a nucleotide triphosphate to said 
catalytic RNA molecule. 

20 11. The catalytic RNA molecule of claim 10, 

wherein said transfer is to the 5'-hydroxyl or to an 
internal 2'-hydroxyl of said catalytic RNA molecule. 



12. A catalytic RNA molecule which can transfer a 
phosphate from a nucleotide triphosphate to a nucleic 
25 acid other than said catalytic RNA molecule. 
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13. The catalytic RNA molecule of claim 12, 
wherein said nucleic acid is a ribonucleic acid. 

14. A catalytic RNA capable of catalyzing N- 
alkylation. 

15. A method for producing a catalytic RNA 
molecule capable of binding a first ligand and catalyzing 
a chemical reaction modifying a first substrate molecule 
bound to said catalytic RNA molecule, comprising the 
steps of: 

a) providing a first population of RNA molecules 
each having a first region of random sequence; 

b) contacting said first population with said 
first ligemd; 

c) isolating a first ligand-binding 

. subpopulation of said first population of RNA molecules 
by partitioning RNA molecules in said first population of 
RNA molecules which specifically bind said first ligand 
from those which do not; 

d) amplifying said first ligand binding 
subpopulation in vitro ; 

e) identifying a first ligemd binding sequence; 

f ) preparing a second population of RNA 
molecules each of said RNA molecules comprising said 
first ligand binding sequence and a second region of 
random sequence, each of said RNA molecules being bound 
to said first substrate molecule; 

g) contacting said second population of RNA 
molecules with a second ligand capable of binding said 
first ligand-binding sequence; and 

h) isolating a subpopulation of said catalytic 
RNA molecules from said second population of RNA 
molecules by partitioning RNA molecules which are bound 
to a substrate molecule which has been modified in step 
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g) from those RNA molecules which are bound to a 
substrate molecule which has not been modified in step 

g). 

16. The method of claim 15, wherein said second 
5 ligand serves as a second substrate for said chemical 
reaction. 
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